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In  this  dissertation  we  propose  a feature-oriented  label 

graph  isomorphism  algorithm  to  solve  the  label  graph 

isomorphism  problem.  Almost  all  graph  isomorphism 

algorithms  proposed  so  far  deal  with  the  non-label  graph 

isomorphism  problem.  However,  one  major  drawback  of  all 

non-label  graph  isomorphism  algorithms  proposed  so  far,  even 

if  based  on  different  techniques,  is  that  they  are  generally 

under-constrained.  Features  and  primitives,  which  can  be 

easily  fetched  in  a graph,  are  not  used  in  the  non-label 

graph  isomorphism  algorithm.  This  limitation  makes  the 

non-label  graph  isomorphism  algorithm  incapable  of  resolving 

the  label  graph  isomorphism  problem.  In  order  to  alleviate 

this  problem,  a feature-oriented  label  graph  isomorphism 

algorithm  utilizing  the  vertex  and  edge  features  of  a graph 


vi 


is  studied  and  developed  in  this  dissertation.  The 
developed  algorithm  extracts  the  vertex  and  edge  features 
from  the  graph  and  applies  them  to  the  computation  of 
invariants  with  higher  levels  of  differentiating  power. 
Conseguently , the  developed  algorithm  is  capable  of 
resolving  the  label  graph  isomorphism  problem  that  the  non- 
label graph  isomorphism  algorithm  cannot  resolve.  We  have 
successfully  applied  the  proposed  graph  isomorphism 
algorithm  to  two  engineering  problems  and  demonstrated 
encouraging  results  in  these  two  applications. 
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CHAPTER  I 
INTRODUCTION 

1.1  Motivation 

A graph  in  which  each  vertex  and/or  edge  is  assigned  a 
label  is  called  a label  graph.  For  instance,  the  three 

pairs  of  graphs  in  Figures  1.1-1. 3 are  label  graphs  because 
the  vertices  and/or  edges  are  labeled.  In  these  three  pairs 
of  graphs,  the  label  of  a vertex  is  the  feature  or  property 
(such  as  vertex  degree,  vertex  type,  chemical  atom,  etc.) 
associated  with  the  vertex,  and  the  label  of  an  edge  is  the 
feature  or  property  (such  as  electronic  component,  chemical 
bond,  etc.)  associated  with  the  edge. 

With  the  introduction  of  the  label  graph,  we  now  can 
define  the  label  graph  isomorphism.  Two  label  graphs  = 
(V^,E^)  and  = (V^,E^)  are  isomorphic  if  there  is  a one- 
to-one  correspondence  between  the  vertices  of  G^  and  G^ 
which  preserves  a one-to-one  correspondence  between  the 
edges  of  the  graphs.  If  an  isomorphism  exists  between  two 
label  graphs,  then  the  label  graphs  are  isomorphic  and  the 
problem  of  determining  whether  two  given  label  graphs  are 
isomorphic  is  termed  the  label  graph  isomorphism  problem. 

Almost  all  graph  isomorphism  algorithms  proposed  so  far 
deal  with  the  non-label  graph  isomorphism  problem.  However, 
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Figure  1.1  a)  Circuit  Diagram  1,  b)  Circuit  Diagram  2, 
c)  C-Graph  1,  d)  C-Graph  2. 
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Figure  1.2 
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a)  Pictorial  Graph  1,  b)  Pictorial  Graph  2. 


4 


Figure  1.3 


a)  Chemical  Molecule  1,  b)  Chemical  Molecule  2, 
c)  Chemical  Graph  1,  d)  Chemical  Graph  2. 
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one  major  drawback  of  these  non-label  graph  isomorphism 
algorithms  proposed  so  far,  even  if  based  on  different 
techniques,  is  that  they  are  generally  under-constrained. 
Features  and  properties,  which  can  be  easily  fetched  in  a 
graph,  are  not  used  in  the  non-label  graph  isomorphism 
algorithms.  This  constraint  makes  the  non-label  graph 
isomorphism  algorithms  incapable  of  resolving  the  label 
graph  isomorphism  problem.  To  illustrate  the  limitation 
imposed  by  the  non-label  graph  isomorphism  algorithms,  let 
us  consider  the  label  graphs  shown  in  Figures  1.1-1. 3. 
Without  considering  the  vertex  and/or  edge  labels 
associated  with  each  vertex  and/or  edge  in  the  graphs,  the 
non-label  graph  isomorphism  algorithms  will  recognize  these 
three  pairs  of  label  graphs  as  isomorphic.  In  fact,  these 
three  pairs  of  label  graphs  are  non-isomorphic.  They  are 
the  representative  graphs  of  different  circuits,  pictorial 
drawings,  and  chemical  molecules  with  different  physical 
meanings.  The  non-label  graph  isomorphism  algorithms  fail 
to  resolve  the  label  graph  isomorphism  problem  because  the 
vertex  and/or  edge  labels  are  not  considered  in  the 
algorithms.  In  order  to  remedy  this  problem,  we  propose  a 
label  graph  isomorphism  algorithm  utilizing  the  vertex 
and/or  edge  features  presented  in  the  graph  such  that  it  can 
resolve  the  label  graph  isomorphism  problem  that  the  non- 
label graph  isomorphism  algorithm  cannot  resolve. 
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1.2  Statement  of  Problem 

In  this  dissertation,  we  propose  a feature-oriented 
label  graph  isomorphism  algorithm  which  is  effective  for  a 
large  class  of  label  graphs  and  can  be  applied  to  the 
problems  that  can  be  formulated  as  the  label  graph 

isomorphism  problem.  The  problem  we  want  to  solve,  given  an 

object  graph  and  a set  of  model  graphs  stored  in  the 

library,  is  to  determine  which  model  graph  in  the  library  is 
isomorphic  to  the  object  graph;  and  if  such  an  isomorphism 
exists,  delineate  the  vertex-to-vertex  correspondence 
between  these  two  graphs. 

In  this  dissertation,  all  graphs  are  assumed  to  be  label 
graphs,  so  that  every  vertex  and/or  edge  is  assigned  a 
label.  Two  label  graph  configurations  are  considered  in 
this  dissertation,  they  are  the  label  graphs  under 

isomorphism  and  the  label  graphs  under  pictorial  rotation, 
reflection,  and  shear.  In  the  second  case,  a label  graph 
is  the  graph  representation  of  its  corresponding  picture. 
The  pictures  under  rotation,  reflection,  and  shear  are 
compared  for  isomorphism. 

1.3  Usefulness  of  Graph  Isomorphism  Algorithm 

The  graph  isomorphism  problem  arises  in  a number  of 
fields  in  which  graph  theory  is  employed,  such  as  chemistry. 
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switching  theory,  information  retrieval,  linguistics,  and 
network  theory. 

One  of  the  most  widespread  uses  of  graph  isomorphism 
algorithm  in  chemical  information  systems  is  in  the 
matching  of  chemical  structures  (Sussenguth  [1965],  Morgan 
[1965],  Ming  and  Tauber  [1971],  Balaban  [1975]).  Looking 
for  the  name  and  other  information  associated  with  a 
particular  structure  involves  the  matching  of  the  structure 
against  a library  of  known  structures.  A chemical  compound 
is  represented  by  its  atomic  structure  diagram  (an 
undirected  graph  with  labeled  vertices  and  labeled  edges) . 
A given  compound  is  matched  against  the  compounds  contained 
in  a large  file  or  library.  A graph  isomorphism  indicates 
that  the  given  compound  is  already  in  the  library.  Due  to 
the  nature  of  chemical  compounds,  it  is  natural  to  map  them 
onto  the  graph  structure  for  convenience  of  manipulation. 
One  of  the  main  advantages  of  representing  a chemical 
compound  by  the  graph  is  that  the  description  can  be  in 
terms  of  the  constituent  primitives  of  the  compound.  In 
this  case  the  primitives  are  the  atoms  present  in  the 
compound  of  interest. 

Other  fields  in  which  the  graph  isomorphism  problem 
occurs  are  design  automation  and  information  retrieval. 
Weinberg  [1966]  developed  algorithms  in  determining  whether 
two  planar  triply  connected  graphs  were  isomorphic  and 
applied  the  algorithms  for  efficient  layout  of 
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microelectronic  circuits.  An  application  in  the  field  of 
information  retrieval  is  the  matching  of  a query  structure 
against  a dictionary  structure  (Salton  [1964],  Salton  and 
Sussenguth  [1965]).  The  retrieval  system  would  search  for 
substructures  of  the  dictionary  structures  that  are 
isomorphic  to  the  query  structure.  Seshu  and  Reed  [1961] 
indicate  how  a graph  may  represent  an  electrical  network. 
One  possible  use  of  a graph  isomorphism  tester  is  to 
ascertain  whether  a given  network  has  been  previously 
patented.  This  could  be  achieved  by  comparing  the  given 
network  to  all  previously  patented  networks  (Cornog  and 
Bryan  [1966] ) . 


1.4  Previous  Research 

Some  of  the  graph  isomorphism  algorithms  proposed  so  far 
will  be  briefly  described  in  this  section. 

Let  us  first  look  at  a trivial  algorithm  known  as  vertex 
reordering  algorithm.  Given  two  graphs  G^=(V^,E^)  and 
g2=(v2,e^),  all  mappings  of  onto  are  considered  in  the 
vertex  reordering  algorithm.  Theoretically,  it  is  always 
possible  to  determine  whether  or  not  two  graphs  G^  and  G^ 
are  isomorphic  by  keeping  G^  fixed  and  reordering  vertices 
of  G^  to  check  if  their  adjacency  matrices  become  identical. 
In  this  algorithm  the  vertices  of  one  of  the  graphs,  say 
G^ , are  systematically  reordered,  and  each  graph  determined 
by  such  a reordering  is  checked  for  identity  with  G^.  G^  is 
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isomorphic  to  if  and  only  if  is  identical  to  at  least 
one  of  the  graphs  determined  by  a reordering.  This  process 
may  reguire  all  n!  reorderings  and  comparisons,  n being  the 
number  of  vertices.  This  is  essentially  implemented  in  the 
vertex-to-vertex  search,  an  early  systematic  attempt  to 
resolve  the  problem.  The  scheme  reguires  extensive 
bookkeeping  of  the  examined  possibilities,  involves 
considerable  backtracking,  and  becomes  impractical  for 
applications  to  graphs  of  medium  size  and  complexity.  Such 
an  inefficient  procedure,  in  which  the  running  time  grows 
factorially  with  n,  is  of  limited  use  for  practical 
problems.  It  has  been  calculated  that  such  an  algorithm 
implemented  on  a computer  could  reguire  75,000  years  to 
resolve  a pair  of  20  vertex  graphs  (Berztiss  [1973a]).  An 
algorithm  guaranteeing  a solution  in  running  time 
proportional  to  a constant  power  of  n is  desirable,  but  no 
such  algorithm  has  been  discovered  for  determining  if  two 
arbitrary  graphs  are  isomorphic. 

Alternative  schemes  considered  in  the  literature  attempt 
to  recognize  a singular  property  or  a selection  of 
properties  that  could  differentiate  between  non-isomorphic 
graphs.  Such  properties  include  the  graph  spectrum 
(Masuyama  [1973]),  distribution  of  valences  of  vertices  and 
discrimination  of  edge  types  (Goodman  and  Cummins  [1965], 
Levi  [1974]),  and  examination  of  subgraphs  and  their 
characteristic  polynomials  (Spialter  [1964],  Turner  [1968], 
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Kudo  [1973]).  However,  if  one  uses  any  combination  of 
criteria  such  as  mentioned  above,  one  cannot  be  sure  that 
all  pairs  of  graphs  would  be  differentiated  by  the  criteria. 
This  ultimate  uncertainty  undermines  such  efforts,  which  at 
best  provide  a list  of  necessary  conditions  for  isomorphism. 

Turner  [1968]  showed  that  point  symmetric  graphs  with  a 
prime  number  of  vertices  are  isomorphic  if  and  only  if  their 
respective  adjacency  matrices  have  the  same  eigenvalues.  We 
have  just  observed  that  an  adjacency  matrix  A of  a graph 
depends  on  how  the  graph  is  labeled.  A graph  can  have  as 
many  adjacency  matrices  as  there  are  ways  of  labeling  its 
vertices.  However,  the  determinant  |a  - ^l|  is  clearly  not 
altered  if  the  graph  is  relabeled,  and  hence  is  an  invariant 
of  the  graph.  Thus  the  characteristic  eguation  of  A is  also 
an  invariant  of  the  graph,  so  is  the  set  of  its  roots,  and 
so  is  the  set  of  eigenvectors. 

There  are  some  algorithms  (Parris  and  Read  [1967], 
Berztiss  [1973b],  Shah,  Davida,  and  McCarthy  [1974])  which 
are  based  on  the  property  that  two  graphs  are  isomorphic  if 
and  only  if  they  have  the  same  code,  i.e.,  a canonical  or 
standard  description  of  a given  graph.  The  object  of  the 
algorithm  proposed  by  Berztiss  [1973b]  is  to  generate  the  K- 
formula  for  and  attempt  to  derive  a similar  K-formula  to 
describe  using  the  transformations  of  the  K-operator. 
Successful  generation  of  a K-formula  for  G^  such  that  a 
consistent  substitution  of  vertices  of  G^  into  the  K-formula 
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for  produces  the  same  pattern  establishes  an  isomorphism. 
In  fact,  because  all  possible  choices  are  investigated,  all 
isomorphisms  can  be  found.  A second  graph  encoding 
algorithm  is  due  to  Shah,  Davida,  and  McCarthy  [1974].  This 
algorithm,  valid  only  for  undirected  graphs,  derives  from 
the  adjacency  matrix  a canonical  code  for  a graph.  The 
canonical  code  for  a graph  is  defined  as  the  binary  number 
formed  by  concatenation  on  successive  rows  of  the  upper 
triangle  of  the  adjacency  matrix.  This  code  is  canonical 
if  the  rows  and  corresponding  columns  of  the  adjacency 
matrix  are  permuted  so  that  the  resulting  binary  number  is 
maximum.  Backtracking  is  used  to  determine  the  canonical 
code.  If  the  canonical  code  for  two  graphs  are  the  same, 
then  the  mapping  specified  by  the  renumbering  is  an 
isomorphism. 

A number  of  graph  isomorphism  algorithms  have  been 
developed  using  heuristics  (Nagle  [1966],  Kuhn  [1971], 
Cornell  and  Kirkpatrick  [1980],  Dewdney  [1978]).  They  all 
obtain  an  initial  partition  based  on  the  necessary  condition 
that  the  degree  of  the  vertices  mapped  must  be  equal  if  the 
mapping  is  to  be  an  isomorphism.  Having  obtained  the 
initial  partition,  various  heuristics  based  on  one  or  more 
necessary  conditions  for  isomorphism  have  been  used.  The 
philosophy  of  a heuristic  is  to  exploit  necessary  conditions 
for  isomorphism  to  reduce  the  size  of  the  search  tree. 
Having  improved  the  partition,  the  heuristic  is  repeated. 


12 


If  the  search  tree  is  sufficiently  reduced,  all  remaining 
possibilities  may  be  examined.  Otherwise,  additional 
heuristics  must  be  used.  Generally,  the  heuristics  work 
reasonably  well  for  most  pairs  of  graphs  but  have  difficulty 
with  some  classes  of  graphs.  Many  existing  algorithms  seem 
to  encounter  the  most  difficulty  with  graphs  containing  a 
high  degree  of  symmetry  because  the  heuristics  cannot 
distinguish  between  vertices  with  similar  characteristics. 

Sussenguth  [1964]  described  a method  of  cataloging 
chemical  compounds  by  describing  the  structure  as  an 
undirected  graph.  In  his  model,  the  vertices  represent  the 
individual  atoms  and  the  edges  represent  the  valence  bonds 
between  pairs  of  atoms.  Given  a chemical  compound,  he 
searches  the  library  to  determine  if  its  graph  is  isomorphic 
to  any  graph  in  the  library.  His  system  also  considers 
whether  the  compound  is  a subcompound  of  the  library 
compound.  A subgraph  isomorphism  indicates  that  the  given 
compound  is  a subcompound  of  the  library  compound  under 
scrutiny;  a graph  isomorphism  indicates  that  the  given 
compound  is  already  in  the  library.  Sussenguth 's  algorithm 
is  based  on  two  principles.  First,  if  graphs  G^  and  G^  are 
isomorphic,  then  the  subset  of  vertices  of  G^  that  exhibit 
some  property  must  correspond  to  the  subset  of  vertices  of 
that  exhibit  the  same  property.  Second,  if  the  subsets 
of  vertices  of  G^  and  Qp-  that  are  characterized  by  some 
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property  do  not  have  the  same  number  of  elements,  then  the 
two  graphs  cannot  be  isomorphic. 

Unger's  [1964]  heuristic  of  successive  partitions  based 
on  the  class  relation  of  adjacent  vertices  represents  an 
attempt  to  consider  the  relation  of  a vertex  with  its 
neighbors.  He  first  uses  the  necessary  condition  that  if  a 
mapping  0 with  0(Va)  = vj^  is  to  be  an  isomorphism,  then  the 
positive  and  negative  degrees  of  the  vertices  must  be  the 
same.  This  initial  condition  can  reduce  the  size  of  the 
search  tree.  At  this  point,  Unger  introduces  a second 
condition.  Let  U^  denote  the  set  of  vertices  adjacent  to 
vertex  v^  and  Uj^  denote  the  set  of  vertices  adjacent  to 
vertex  vj^.  Assuming  v^  and  vjj  have  the  same  class 
assignment,  then  0(v^)  = vjj  may  be  an  isomorphism  only  if 
the  number  of  vertices  in  each  class  in  U^  is  equal  to  the 
number  of  vertices  in  the  respective  class  in  Uj^.  Unger 
requires  the  weaker  condition  that  the  sum  of  the  classes  of 
vertices  in  U^  must  equal  the  sum  of  classes  in  Uj^.  Most  of 
the  recent  algorithms  attempt  to  extend  this  idea  of 
categorizing  a vertex  by  the  company  it  keeps,  i.e.,  its 
adjacency  relationships. 

Corneil  and  Gotlieb's  [1970]  algorithm  attempted  to 
generate  a canonical  graph  which  is  a unique  homomorphic 
image  of  the  given  graph.  Vertices  are  partitioned  into 
classes  based  on  their  respective  degrees  much  as  other 
algorithms  do.  He  then  iteratively  refines  the  partitioning 
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by  considering  the  classes  of  the  vertices  to  which  each 
vertex  is  adjacent. 

1.5  Comparison  Between  Previous  Heuristic  Approach 

and  Our  Approach 

The  heuristic  approach  has  been  the  most  popular 
approach  in  solving  the  label  graph  isomorphism  problem  in 
the  past  three  decades  (Sussenguth  [1965],  Dewdney  [1978], 
Corneil  and  Kirkpatrick  [1980]).  Our  proposed  label  graph 
isomorphism  algorithm  is  based  on  the  heuristic  approach; 
however,  we  make  two  major  improvements  on  the  previous 
heuristic  approach.  The  first  improvement  of  our  proposed 
graph  isomorphism  algorithm  is  to  develop  an  iterative  set 
partitioning  scheme  to  effectively  partition  the  vertex  set 
into  subsets,  while  the  previous  heuristic  approach  uses  an 
inefficient  set  reduction  partitioning  scheme  to  partition 
the  vertex  set  (Sussenguth  [1964],  Ming  and  Tauber  [1971], 
Figueras  [1972]).  We  will  elaborate  on  this  topic  and  make 
a comparison  between  these  two  partitioning  schemes  in 
section  2.3.1. 

The  second  improvement  that  we  make  is  to  develop  a 
powerful  ambiguity  resolution  scheme.  The  ambiguity  occurs 
when  a unique  one-to-one  vertex  correspondence  cannot  be 
generated  by  further  partitioning  the  already  refined  sets. 
In  the  previous  heuristic  approach,  there  are  two  methods  to 
resolve  the  ambiguity.  The  first  method  is  to  exhaustively 
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enumerate  the  possible  vertex  mappings  and  the  second  method 
is  to  use  a tedious  higher-order  connectivity  search  (Penny 
[1965],  Figueras  [1972],  Deo  [1974]).  Both  of  these  two 
methods  use  brute  force  and  are  time-consuming  in  resolving 
the  ambiguity.  In  our  proposed  algorithm,  we  develop  a very 
straightforward  and  powerful  ambiguity  resolution  scheme  to 
resolve  the  ambiguity.  The  pairs  of  vertex  sets  are 
analyzed  and  partitioned  effectively  in  an  effort  to 
determine  a unique  one-to-one  correspondence  between 
vertices.  We  will  discuss  our  ambiguity  resolution  scheme 
in  section  5.2.4  and  compare  it  with  that  used  in  the 
previous  heuristic  approach. 

1.6  Outline  of  the  Remainder  of  the  Dissertation 

Chapter  II  introduces  the  proposed  feature-oriented 
label  graph  isomorphism  algorithm.  The  proposed  algorithm 
utilizing  a set  of  feature-oriented  invariants  in  resolving 
the  label  graph  isomorphism  problem.  The  algorithm, 
consisting  of  the  isomorphism  test  and  the  vertex 
partitioning,  is  especially  useful  when  one  has  many  graphs 
to  test.  The  proposed  algorithm  is  effective  for  a large 
class  of  label  graphs  and  terminates  either  providing  the 
isomorphism,  or  indicating  that  there  is  no  isomorphism  for 
the  given  pair  of  label  graphs. 

Chapter  III  analyzes  the  cycle  structure  of  a label 
graph.  Since  the  cycle  structure  of  a label  graph  often 
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gives  significant  clues  to  the  cyclic  information  of  the 
graph,  the  invariants  generated  based  on  the  cycle  analysis 
are  especially  useful  and  applicable  to  a cyclically 
involved  graph.  A set  of  cycle  invariants  are  derived  for 
each  vertex  based  on  the  cycles  to  which  it  belongs.  These 
invariants  provide  important  information  for  the  graphs 
under  pictorial  rotation,  reflection,  and  shear,  and  for  the 
graphs  under  isomorphism. 

Chapter  IV  presents  a canonical  encoding  technigue  to 
encode  a graph  into  a unique  canonical  representation.  This 
process  allows  all  known  graphs  to  be  compiled  into  their 
corresponding  canonical  representation.  The  canonical  code, 
which  is  a sufficient  condition  for  the  existence  of  an 
isomorphism,  and  the  canonical  string,  which  is  a necessary 
condition  for  the  existence  of  an  isomorphism,  are  then 
derived  based  on  the  encoded  canonical  representation.  The 
canonical  code  and  canonical  string  of  the  unknown  graph  is 
then  matched  against  the  canonical  codes  and  canonical 
strings  of  all  known  graphs.  If  any  two  graphs  have 
identical  canonical  codes,  then  we  can  conclude  that  the  two 
graphs  are  isomorphic  and  the  vertex  correspondence  between 
the  vertex  sets  of  these  two  graphs  are  obtained  by  pairing 
the  final  canonical  orderings  of  these  two  graphs.  As  to 
the  canonical  string,  it  can  be  used  as  an  invariant  in  the 
isomorphism  test  to  eliminate  those  non-isomorphic  graphs  at 
early  steps. 
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Chapter  V analyzes  the  connectivity  structure  of  a 
graph.  The  purpose  of  it  is  to  derive  the  connectivity 
invariants  of  graphs  and  vertices,  which  can  be  used  either 
as  tools  in  the  isomorphism  test  or  as  prime  criteria  in  the 
vertex  partitioning  to  resolve  the  ambiguity.  The  ambiguity 
occurs  when  a unigue  vertex  correspondence  cannot  be 
generated.  The  distance  matrix  derived  based  on  the 
connectivity  analysis  is  the  last-resort  method  to  resolve 
the  ambiguity  because  the  distance  matrix  characterizes  the 
connectivity  relationships  between  any  vertex  and  the  rest 
of  the  graph. 

In  Chapter  VI,  the  applications  of  the  proposed  graph 
isomorphism  algorithm  are  presented.  Two  applications  are 
reported  in  this  dissertation,  which  are  (1)  the  matching 
and  identification  of  circuit  diagrams  with  PC  boards,  and 
(2)  the  representation  of  pictorial  drawings  for  3-D 
recognition.  These  two  applications  are  implemented  on  a 
VAX  11/750  machine  and  demonstrate  successful  results. 

Chapter  VII  summarizes  the  major  works  reported  in  this 
dissertation,  and  provides  suggestions  for  further  research. 


CHAPTER  II 

THE  LABEL  GRAPH  ISOMORPHISM  ALGORITHM 

A feature-oriented  label  graph  isomorphism  algorithm  for 
solving  the  label  graph  isomorphism  problem  is  presented  in 
this  chapter.  The  developed  label  graph  isomorphism 
algorithm  extracts  the  vertex  and  edge  features  from  the 
label  graph  and  applies  them  to  the  computation  of  feature- 
oriented  invariants  with  higher  level  of  differentiating 
power  compared  to  that  of  the  general  invariants.  The 
proposed  algorithm  guarantees  to  lead  to  a solution  by 
employing  a set  of  feature-oriented  invariants  which  form  a 
necessary  and  sufficient  condition  for  the  existence  of  an 
isomorphism. 

The  heuristic  approach  has  been  the  most  popular 
approach  for  solving  the  label  graph  isomorphism  problem  in 
the  past  decade.  To  show  that  two  label  graphs  G^  and  G^ 
are  not  isomorphic,  the  usual  heuristic  approach  is  to 
compare  their  graph  invariants.  A graph  invariant  of  a 

graph  G is  a number  that  is  the  same  for  all  graphs 
isomorphic  to  G.  Some  examples  of  graph  invariants  are 
number  of  vertices,  number  of  edges,  connectivity,  and  so 
on.  If  two  graphs  G^  and  G^  have  a different  graph 
invariant,  then  we  conclude  that  these  two  graphs  are  not 
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isomorphic.  Therefore  the  graph  invariants  can  be  employed 
to  eliminate  those  non-isomorphic  graphs  at  early  steps  when 
one  has  many  graphs  to  test.  Numerous  heuristic  algorithms 
have  been  proposed  based  on  the  idea  that  if  you  compute 
many  graph  invariants,  and  if  they  are  the  same  for  both 
graphs  and  , it  is  likely  that  G^  and  G^  are 

isomorphic.  However,  two  graphs  having  the  same  graph 

invariants  do  not  guarantee  an  isomorphism.  The  last-resort 
method  to  ensure  the  isomorphism  is  to  delineate  the 
vertex-to-vertex  correspondence  between  G^  and  G^  by 
employing  a set  of  vertex  invariants.  A vertex  invariant  of 
a vertex  is  a number  that  is  invariant  for  the  vertex  under 
graph  isomorphism.  The  heuristic  algorithms  try  to  reduce 
the  number  of  possible  vertex  mappings  by  employing  vertex 
invariants  to  partition  the  vertex  sets  of  the  two  graphs. 
For  example,  by  applying  the  vertex  degree  to  the  vertex 
sets  of  a pair  of  graphs,  the  number  of  possible  vertex 
mappings  can  always  be  reduced  from  n!  to  jrn^! , where 

= n.  For  a graph  that  does  not  have  a large 
percentage  of  its  vertices  with  the  same  degree,  the  number 
Trnj^!  is  much  smaller  than  n!  . The  underlying  idea  behind 
the  heuristic  algorithms  which  generates  the  vertex-to- 
vertex  correspondence  between  a pair  of  graphs  is  to  use 
various  vertex  invariants  (such  as  degree,  vertex/edge 
label,  adjacency,  etc.)  of  vertices  in  the  two  graphs  to 
generate  pairs  of  vertex  subsets,  which  must  match  if  the 
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graphs  are  to  be  isomorphic.  An  increasing  number  of  vertex 
invariants  are  used  to  partition  vertices  into  smaller  and 
smaller  subsets.  Eventually,  either  every  vertex  in  one 
graph  is  uniquely  paired  off  with  a vertex  in  the  other 
graph,  or  two  subsets  of  vertices  characterized  by  identical 
vertex  invariants  in  the  two  graphs  do  not  have  the  same 
number  of  vertices.  Reduction  of  the  computational 
complexity  is  usually  achieved  by  use  of  these  invariants. 

In  this  chapter,  we  will  propose  a partitioning  scheme 
which  is  an  improvement  over  the  partitioning  scheme  used  in 
the  previous  heuristic  approach.  In  addition  to  the 
partitioning  scheme,  we  will  briefly  describe  the 
architecture  of  the  proposed  algorithm  and  specify  the 
proposed  label  graph  isomorphism  algorithms  for  different 
class  of  graphs. 


2 . 1 Overview  of  the  Algorithm 

The  flow-diagram  of  the  proposed  graph  isomorphism 
algorithm  is  illustrated  in  Figure  2.1.  The  input  is  an 
object  graph  and  a set  of  model  graphs  stored  in  the 
library.  We  want  to  determine  which  model  graph  in  the 
library  is  isomorphic  to  the  object  graph.  If  such  an 

isomorphism  exists,  then  delineate  the  vertex-to-vertex 
correspondence  between  these  two  graphs.  The  proposed  graph 
isomorphism  algorithm  consists  of  two  phases:  the 

isomorphism  test  and  the  vertex  partitioning.  The  graph 
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invariants  containing  integral  information  on  all  vertices 
of  a graph  are  used  as  preliminary  screens  for  the 
isomorphism  test.  The  isomorphism  test  can  be  viewed  as  a 
filter  to  reject  those  non-isomorphic  graphs.  If  a pair  of 
graphs  differ  in  any  one  of  these  graph  invariants,  then 
these  two  graphs  are  declared  as  non-isomorphic  and  the  test 
continues  to  another  pair  of  graphs.  In  the  vertex 
partitioning,  vertex  invariants  are  employed  to  those  graphs 
which  pass  the  isomorphism  test  to  delineate  the  explicit 
vertex-to-vertex  correspondence. 

The  proposed  algorithm  is  most  useful  when  testing 
several  graphs  for  isomorphism.  Namely,  one  has  to  compare 
many  stored  model  graphs  with  an  object  graph  and  find  one 
model  graph  that  is  isomorphic  to  the  object  graph.  In 
this  case,  the  isomorphism  test  serves  as  a screening 
process  to  select  candidate  model  graphs  which  might  be 
isomorphic  to  the  object  graph.  The  vertex  partitioning 
serves  as  an  identification  process  to  ensure  the 
isomorphism  by  determining  a one-to-one  correspondence 
between  the  vertices  of  the  match  model  graph  and  the 
object  graph. 

In  the  construction  of  the  algorithm  certain  ideas  that 
were  expressed  in  Corneil  [1968]  and  Druffel  [1975]  are 
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2.2  Isomorphism  Test 

To  test  whether  two  graphs  and  are  isomorphic,  a 
set  of  graph  invariants  are  employed  as  the  necessary 
conditions  to  test  the  isomorphism  of  the  two  graphs.  A 
graph  invariant  is  a property  or  parameter  that  is  preserved 
by  isomorphism.  Put  another  way,  it  is  a property  which 
does  not  depend  on  the  way  in  which  the  graph  is  labeled. 
For  example,  if  we  consider  the  number  of  vertices  as  the 
graph  invariant,  then  two  graphs  must  have  the  same  number 
of  vertices  if  they  are  to  be  isomorphic. 

The  isomorphism  test  is  especially  useful  when  one  has 
many  graphs  to  test.  In  the  isomorphism  test  the  object 
graph  and  the  model  graph  are  analyzed  in  parallel;  if 
there  is  any  discrepancy  in  the  response  of  the  two  graphs 
to  a graph  invariant,  then  we  conclude  that  these  two  graphs 
are  not  isomorphic. 

Although  the  isomorphism  test  cannot  guarantee  the 
isomorphism,  it  is  nevertheless  of  practical  value.  It  is 
useful  in  two  ways.  First,  it  can  be  used  to  prove  that 
graphs  are  non-isomorphic.  Since  all  graph  invariants  are 
preserved  by  isomorphism,  a pair  of  graphs  differing  in  any 
of  these  graph  invariants  cannot  possibly  be  isomorphic. 
The  second  use  to  which  the  graph  invariants  can  be  put  is 
that  they  can  provide  the  basis  for  a conjecture  that  two 
graphs  are  isomorphic.  It  could  be  conjectured  that  if  we 
are  not  able  to  detect  the  non-isomorphism  by  applying  all 
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graph  invariants  to  a pair  of  graphs  and  , then  graphs 
G^  and  G^  are  isomorphic.  Even  if  a pair  of  graphs  satisfy 
all  graph  invariants,  they  are  not  necessarily  isomorphic, 
but  having  checked  that  they  do  not  differ  in  these  obvious 
ways,  it  is  at  least  less  of  a gamble  to  invest  some  time 
searching  for  an  isomorphism. 

2 . 3 Vertex  Partitioning 

Any  model  graphs  which  do  not  pass  the  isomorphism  test 
are  declared  as  non-isomorphic  to  the  object  graph  and  can 
be  eliminated  immediately.  The  difficulty  with  most  graph 
invariants  is  that  two  graphs  may  have  the  same  graph 
invariants,  yet  not  be  isomorphic.  In  other  words,  those 
model  graphs  which  pass  the  isomorphism  test  are  not 
guaranteed  to  be  isomorphic  to  the  object  graph,  since  graph 
invariants  are  necessary  but  not  sufficient  conditions  for 
graph  isomorphism.  To  ensure  the  isomorphism,  we  have  to 
delineate  the  vertex-to-vertex  correspondence  of  the  two 
graphs.  For  any  vertex  Vj^  in  graph  G^  and  0(vj^)  in  graph 
G^ , if  the  one-to-one  correspondence  between  and  <p(vj^)  in 
the  two  graphs  is  found,  then  graphs  G^  and  G^  are 
guaranteed  to  be  isomorphic.  The  method  to  generate  the 
vertex-to-vertex  correspondence  between  a pair  of  graphs  is 
to  use  various  vertex  invariants  of  vertices  in  the  two 
graphs  to  generate  pairs  of  vertex  subsets,  which  must  match 
if  the  graphs  are  to  be  isomorphic.  An  increasing  number  of 
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vertex  invariants  are  used  to  partition  vertices  into 
smaller  and  smaller  subsets.  Eventually,  either  every 
vertex  in  one  graph  is  uniquely  paired  off  with  a vertex  in 
the  other  graph,  or  two  subsets  of  vertices  characterized  by 
identical  vertex  invariants  in  the  two  graphs  do  not  have 
the  same  number  of  vertices. 

2.3.1  Partitioning  Scheme 

We  now  consider  a special  kind  of  set  splitting,  called 
partitioning.  A partitioning  of  a set  is  a subdivision  of 
that  set  into  subsets  such  that  the  subsets  are  both 
disjoint  and  exhaustive. 

The  partitioning  scheme  that  we  use  in  the  vertex 
partitioning  is  called  the  iterative  set  partitioning.  It 
partitions  the  original  vertex  sets  or  refined  cell  sets 
iteratively  until  no  more  refinement  is  possible,  while  the 
previous  heuristic  approach  used  a much  tedious  set 
reduction  partitioning  scheme  (Sussenguth  [1965],  Figueras 
[1972])  which  is  based  on  set  theory  and  Boolean  Algebra 
rather  than  the  graph-theoretic  approach. 

2. 3. 1.1.  Set  reduction  partitioning 

Set  reduction  partitioning  is  employed  when  two  vertex 
invariants  are  applied  independently  to  the  original  vertex 
sets  of  the  graphs.  The  partitioning  is  required  whenever  a 
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number  of  pairs  of  corresponding  sets  have  been  generated. 
The  purpose  of  the  partitioning  is  to  reduce  the  number  of 
possible  correspondents  of  each  vertex  of  G to  as  few 
vertices  of  G*  as  possible.  This  is  done  by  examining  the 
pairs  of  generated  subsets.  Let  and  S*j[  be  a pair  of 
corresponding  subsets,  that  is  « S*^.  If  vertex  x is  in 
Sj^,  then  its  corresponding  vertex  x*  must  be  in  S*^. 
Therefore,  for  all  those  i for  which  x is  in  Sj^,  x*  is  in 
S*j^;  and,  hence,  x*  is  in  the  intersection  of  those  S*£. 
Symbolically 

^ ""iel  (2-1) 

where  i e I if  x is  in  S . 

If  there  are  pairs  of  corresponding  subsets  Sj  and  S*j 
which  have  the  same  number  of  vertices,  that  is  if  Sj  ~ ^*j' 
then  Sj  « S*j  also  holds.  This  is  true  because  the  set 
equality  implies  a one-to-one  correspondence  between  the 
vertices  of  S j and  S* j , so  that  it  is  impossible  for  a 
vertex  outside  Sj  (that  is,  in  Sj)  to  correspond  to  a vertex 
of  S*j . Therefore  if  vertex  x is  in  Sj , then  its 
correspondent  x*  must  be  in  S*j . In  other  words,  if  Sj  « 
S*j  and  if  vertex  x is  not  in  S j , then  x*  must  be  in  S*j . 

By  using  the  subsets  which  satisfy  these  requirements. 
Equation  2-1  can  be  further  restricted: 
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S*i)  n (n.^j  s*j) 


(2-2) 


where  i e I if  x is  in  Sj^  and  j e J if  Sj  « S*j  and  x is  not 
in  Sj  . 

An  example  may  help  clarify  the  set  reduction 
partitioning  procedure.  Consider  the  following  pairs  of 
corresponding  subsets: 


1^ 

{1,2,4} 

« S*i:  (b,d,f) 

2 = 

(1,3)  » 

S*2-*  (a,d) 

3 • 

(2,4,5) 

« 5*3:  (b,c,f) 

4 * 

(3,4,5) 

« 5*4:  (a,c,f) 

Vertex  1 is  in  subset  and  S2 ; therefore  its  correspondent 
must  be  in  the  intersection  of  S*i  and  S*2.  Vertex  d is  the 
only  vertex  in  this  intersection;  therefore  vertex  1 must 
correspond  to  vertex  d.  Similarly  vertex  2 is  in  S^  and  S3, 
so  its  correspondent  must  be  in  the  intersection  of  S*i  and 
S*3;  moreover,  subset  S4  does  not  include  vertex  2,  so  the 
correspondent  of  vertex  2 must  also  be  in  S*4.  The 
intersection  of  S*^,  8*3,  and  S*4  is  vertex  b;  therefore 
vertex  2 must  correspond  to  vertex  b. 


2. 3. 1.2.  Iterative  set  partitioning 

In  the  iterative  set  partitioning  scheme,  the  subsets  of 
a partitioning  are  called  cells,  and  may  be  formed  by 
grouping  those  elements  of  the  set  which  posses  some  common 
property.  The  collection  of  all  cells  is  called  the  cell 
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set.  In  the  cell  set,  cell  x precedes  cell  y if  the 
elements  in  cell  x posses  the  property  with  smaller  numeric 
or  alphabetic  value  than  that  of  the  elements  in  cell  y. 
Suppose  we  are  given  a vertex  set  V,  and  an  initial  vertex 
cell  set  generated  by  partitioning  the  vertex  set  V into 
disjoint  cells  V12  / * * • » V]^p) , for  some  integer  p > 1. 

Our  task  is  to  find  the  isomorphic  partitioning  with  the 
vertex  cell  set  Vj^,  say,  Vj.  = {Vjr^/ ^k2 ' ’ * ’ »^kq) ' such  that  q 
is  equal  to  the  number  of  elements  in  the  vertex  set. 
Under  the  isomorphic  partitioning,  the  number  of  elements  in 
each  cell  is  equal  to  1,  for  all  i. 

Once  the  vertex  set  has  been  partitioned,  it  is  standard 
practice  to  employ  a refinement  procedure  in  the  hope  of 
producing  a further  refinement  of  the  vertex  set.  The 

purpose  of  the  refinement  is  to  reduce  the  number  of 
possible  mappings  of  each  vertex  of  to  as  few  vertices  of 
as  possible  (ideally  to  a single  vertex)  . This  is  done 
by  examining  the  pairs  of  generated  subsets.  The  refinement 
scheme  we  use  for  finding  the  vertex-to— vertex 
correspondence  between  a pair  of  graphs  is  based  on  the 
iterative  set  partitioning  to  partition  the  set  of  vertices 
iteratively  until  a one-to-one  vertex  correspondence  is 
found.  The  iterative  set  partitioning  consists  of  several 
stages  of  set  partitioning;  those  vertices  with  the  same 
vertex  invariant  are  partitioned  into  the  same  cell  in  each 
stage  of  the  set  partitioning.  It  terminates  either 
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providing  the  isomorphism  by  finding  the  vertex-to-vertex 
correspondence,  or  indicating  that  there  is  no  isomorphism 
for  the  given  pair  of  graphs. 

The  principle  of  the  set  partitioning  is  as  follows;  if 
graph  is  isomorphic  to  graph  , the  vertex  of  G^  which 
exhibit  some  property  must  correspond  to  that  vertex  of  G^ 
which  exhibit  the  same  property.  The  conseguence  of  this 
principle  is  symbolically  represented  as 

(v|v  has  property  p}  = {0(v) |0(v)  has  property  p} 

(2-3) 

For  example,  if  we  consider  the  vertex  degree  as  the 
property,  then  the  vertices  with  degree  4 in  graph  G^  can 
only  correspond  to  those  vertices  with  degree  4 in  graph  G^ . 

Let  us  call  the  vertex  invariant  any  vertex  property 
which  is  invariant  under  graph  isomorphism.  Vertex 
invariants  can  be  usefully  exploited  to  possibly  refine  the 
vertex  set  generated  by  the  previous  vertex  invariant  with 
the  aim  of  leading  to  the  isomorphic  partitioning. 

The  process  of  iterative  set  partitioning  consists  of  a 
finite  number  of  stages.  For  any  specified  vertex  in  G^,  a 
few  candidates  in  G^  were  selected  in  the  first  stage  based 
on  the  comparison  of  the  first  vertex  invariant,  then  a few 
candidates  which  have  passed  the  test  in  the  first  stage  are 
matched  to  the  next  vertex  invariant.  If  the  second  vertex 
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invariant  is  applied  to  initial  cell  sets  and 

either  the  graphs  are  recognized  as  non-isomorphic,  or  a new 
set  of  vertex  cells  V^2  generated.  If  V^2 

and  V^2  t apply  the  third  vertex  invariant  since 

vertex  cells  in  V^2  different  from  those  in 

and  The  vertex  partitioning  process  can  be  carried  on 

until  (1)  graphs  are  found  to  be  non-isomorphic,  (2)  one-to- 
one  vertex  correspondence  has  been  found,  or  (3)  for  some  m 
> 1,  and  If  the  third  condition 

holds,  the  vertex  cell  sets  V^jj,  and  are  called  the 

terminal  cell  sets.  In  other  words,  a vertex  cell  set  is 
terminal  if  the  number  of  cells  in  the  cell  set  is  equal  to 
the  number  of  cells  in  the  cell  set  of  its  previous 
partitioning.  In  the  terminal  cell  sets  situation,  all 
pairs  of  vertices  belonging  respectively  to  and  V^jjji 

are  "equivalent"  (have  the  same  attributes)  with  respect  to 
all  vertex  invariants  used  in  deriving  and  V^j^. 

2. 3. 1.3  Comparison  between  set  reduction  partitioning  and 
iterative  set  partitioning 

To  make  a comparison  between  the  partitioning  schemes 
used  in  the  previous  heuristic  approach  and  our  approach, 
let  us  consider  the  problems  of  determining  whether  or  not 
the  two  chemical  molecules  in  Figure  2.2  are  isomorphic. 

The  process  of  set  reduction  partitioning  used  in  the 
previous  heuristic  approach  for  generating  matching  subsets 
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G 


2 


Figure  2.2  Vertex  Partitioning  Comparison  Between  Previous 
Heuristic  Approach  and  Our  Approach. 
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Table  2.1  Set  Reduction  Partitioning. 


Features 

Corresponding 

Subsets  in 
g2 

Subset 

Number 

Vertex  Label 

C 

1,2, 4, 7 

b,e, f ,h 

1 

0 

3,6,8 

a,c,g 

2 

N 

5 

d 

3 

Degree 

One 

1,3, 5, 6 

a,d,g,h 

4 

Two 

8 

c 

5 

Three 

2,4 

e,f 

6 

Four 

7 

b 

7 

Intersection  of 

Subsets  1 and  4 

1 

h 

8 

Partitioning 

1 

h 

2,4 

e,f 

7 

b 

3,6 

a,g 

8 

c 

5 

d 

Table  2.2  Iterative  Set  Partitioning. 


Features 

Generated  Cell 
G^ 

Sets  of 
g2 

Vertex  Set 

{1,2, 3, 4, 5, 6, 7, 8} 

{a,b,c,d,e,f,g,h} 

Vertex  Label 

CCOCNOCO 

OCONCCOC 

Partitioning 

{ (1,2, 4, 7) ,5, (3,6,8) ) 

{ (b,e,f,h) ,d, (a,c,g) } 

Degree 

1334  1 112 

4331  1 121 

Partitioning 

(1,  (2,4)  ,7,5,  (3,6) ,8} 

(h, (e,f) ,b,d, (a,g) ,c} 
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with  common  vertex  invariants  is  outlined  in  Table  2.1. 
Shown  in  Table  2.2  is  the  process  of  iterative  set 
partitioning  utilized  in  our  proposed  vertex  partitioning 
scheme . 

Notice  that  the  iterative  set  partitioning  technique 
needs  fewer  steps  and  less  complexity  in  generating  the  same 
partitioning  result  compared  to  the  set  reduction 
partitioning  technique  used  in  the  previous  heuristic 
approach. 

2.3.2  Cardinality  Violation 

Let  fk(n)  be  the  value  of  the  vertex  invariant  of 

vertex  n.  Applying  the  vertex  invariant  to  a graph  G = 

(V,E)  causes  a partitioning  of  V into  r equivalent  cells  V)^ 
~ ^^kl' * * ’ / ^kr) ' r > 1,  such  that  two  vertices  Vj^  and  Vj 
belong  to  the  same  cell,  if  and  only  if  fj^Cv^)  = f]^(vj).  To 
illustrate  this,  consider  a graph  G with  six  vertices  v^, 

^F*  Assume  we  use  the  vertex  degree  as 
the  first  vertex  invariant  f^  in  the  vertex  partitioning. 
Assume  the  degrees  of  vertices  in  graph  G are  fi(v;^)  = 2, 

fl(VB)  = 3,  fi(vc)  = 4,  fi(VD)  = 2,  fiCv^)  = 3,  and  fi(Vp)  = 
2.  The  vertex  partitioning  generated  by  f^^  is  = 

{Vii,Vi2,Vi3) , where  = (vj^,VD,Vp)  , V12  = (vb,Ve)  , and 

^13  = (vq)  . Vertices  vj^,  Vp,  and  Vp  are  partitioned  into 
the  same  cell  because  they  all  have  degree  2. 
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Given  two  graphs  and  G^,  let  , • • • , } 

and  * * * / be  the  sets  of  equivalent  cells 

generated  by  the  vertex  invariant  on  sets  of  vertices 
and  of  G^  and  G^  respectively.  Vertices  belonging  to 
and  to  are  equivalent  as  far  as  the  vertex 

invariant  is  concerned.  A vertex  belonging  to  can 

possibly  be  mapped  only  onto  vertices  belonging  to  by 

an  isomorphism  of  G^  onto  G^ . 

A trivial,  but  very  important,  property  of  the  vertex 
partitioning  is  that  corresponding  cells  in  the  two  graphs 
must  have  the  same  number  of  vertices.  To  show  the 
constraint  imposed  by  this  property,  let  us  define  a cell 
count  vector  Z such  that  the  element  is  the  number  of 
vertices  in  cell  i.  It  follows  that  if  G^  is  isomorphic  to 

G^,  then  z^j,  = z^j^,  for  all  i.  If  the  number  of  vertices  of 

graph  G^  which  exhibit  some  property  is  different  from  the 
number  of  vertices  of  graph  G^  which  exhibit  the  same 

property,  then  G^  cannot  be  isomorphic  to  G^ . it  could  be 
concluded  that  graphs  G^  and  G^  were  not  isomorphic  if 
corresponding  cells  and  did  not  have  the  same 

number  of  vertices.  Therefore,  pairs  of  graphs  which  do  not 
have  identical  cell  count  vectors  may  be  rejected 
immediately.  Such  an  occurrence  is  called  a cardinality 
violation.  To  illustrate  the  cardinality  violation,  let  us 
consider  two  graphs  G^  and  G^  with  the  vertex  sets  = 


{Va/Vb/Vc,Vd,Ve,Vf } . Suppose 
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the  k^h  vertex  cell  sets  v\  = { (v^,vd,vf)  , (vb,Ve)  , (vq)  } and 
^ k ~ ( (’^d/’^'^e)  ' generated  by  applying 

the  k'*^^  vertex  invariant  to  the  (k-l)^^  vertex  cell  sets, 
where  = (va,Vd,Vf)  , = (Vb/Ve)  , = (Vq)  , = 

(Vd/Ve)  / V^]^2  = (Va/Vf)  , and  = (vjjjVj^)  . The  cell  count 

vectors  of  these  two  graphs  are  = (3,2,1)  and  = 

(2,2,2).  Since  f and  z^3  / 2^3,  the  cardinality 

violation  occurs  and  it  is  concluded  that  graphs  and 
are  concluded  to  be  non-isomorphic. 

2.3.3  Correspondence  Matrix 

The  utilization  of  vertex  invariant  may  reduce  the 
number  of  possible  mappings  of  each  vertex  of  G^  to  as  few 
vertices  of  G^  as  possible.  To  represent  the  possible 
vertex  mappings  generated  by  the  vertex  invariant,  let  us 
define  the  correspondence  matrix,  P,  as  an  n x n binary 
matrix  such  that  pj^j  = 1 if  the  existence  of  an 
isomorphism,  mapping  vertex  i of  G^  onto  vertex  j of  G^  is 
possible;  pj^j  = 0 if  it  is  impossible  for  such  an 

isomorphism  to  exist.  Let  I denote  the  n x n matrix  every 
element  of  which  is  1.  Initially,  P = I (i.e.,  for  any 

vertex  i e G^,  there  could  be  an  isomorphism  mapping  i onto 

any  vertex  j e G^) . After  the  first  vertex  invariant  has 
been  imposed  and  the  rows  and  columns  of  P have  been 

suitably  reordered,  P is  of  the  block  diagonal  form  as 
illustrated  in  Figure  2.3.  Associated  with  this  matrix,  the 
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maximum  number  of  vertex  reorderings  to  be  performed  by  the 
vertex  reordering  is  = 7t._  where  i is  the  number  of 
cells  in  the  first  cell  set.  Initially  Yq  = nl  . When  we 
apply  the  next  vertex  invariant,  a new  correspondence  matrix 
is  obtained.  This  new  correspondence  matrix  represents  the 
restrictions  imposed  on  the  possible  vertex  pairings  by  both 
the  first  and  second  vertex  invariants.  The  new  maximum 
number  of  vertex  reorderings  is  Y2.  The  process  of  imposing 
new  vertex  invariants  continues  until  the  unique  vertex-to- 
vertex  correspondence  has  been  found.  In  that  case,  the 
correspondence  matrix  is  a diagonal  matrix  as  shown  in 
Figure  2.4. 


2 . 4 Some  Commonly  Used  Invariants 

A list  of  some  commonly  used  invariants  which  will  be 
used  in  this  dissertation  are  briefly  described  in  this 
section.  These  invariants  are  employed  either  in  the 
isomorphism  test  or  vertex  partitioning  to  reduce  the  number 
of  possible  graph  or  vertex  mappings. 

2.4.1  Number  of  Vertices 

The  mapping  0(G^)  = is  an  isomorphism  only  if  the 

number  of  vertices  in  the  two  graphs  is  the  same. 

Since  two  graphs  may  be  isomorphic  only  if  they  have  the 
same  order,  we  may  consider  two  given  graphs  to  have  the 
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G^:  (1,5) , (3,8) , (2,6,7) ,4 

g2:  (2,6) , (7,8) , (1,4,5) ,3 


1 5 3 8 2 


6 7 4 


2 

6 

7 

8 
1 

4 

5 
3 


1 

1 

0 

0 

0 

0 

0 

0 
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1 

0 

0 

0 

0 

0 

0 


0 

0 

1 

1 

0 

0 

0 

0 


0 

0 

1 

1 

0 

0 

0 

0 


0 

0 

0 

0 

1 

1 

1 

0 


0 

0 

0 

0 

1 

1 

1 

0 


0 
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0 

0 

1 

1 

1 

0 


0 

0 

0 

0 

0 

0 

0 

1 


Figure  2.3  Correspondence  Matrix  Illustrating  the  Vertex 
Partitioning  Result  between  Graphs  G^  and  G^ . 


g}:  1,5, 8, 3, 2, 6, 7, 4 

G^:  6, 2, 7, 8, 4, 5, 1,3 


1 5 8 3 2 


6 7 4 
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8 

4 

5 
1 
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0 

0 

0 

0 
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0 
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0 

0 

0 

0 

0 
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0 

0 

1 

0 

0 

0 

0 

0 


0 

0 

0 

1 

0 

0 

0 

0 


0 

0 

0 

0 

1 

0 

0 

0 


0 

0 

0 
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0 
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0 

0 

0 

0 

0 

0 

1 
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0 

0 

0 

0 

0 

0 
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Figure  2.4  Correspondence  Matrix  Illustrating  the 
Isomorphic  Partitioning  between  Graphs 
G^  and  g2. 
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same  vertex  set.  An  isomorphism  is  a one-to-one 
correspondence  between  vertex  sets  and  vertex  sets  are 
finite,  so  isomorphic  graphs  must  have  the  same  number  of 
vertices.  If  and  are  two  graphs,  then  a one-to-one 
correspondence  between  G^  and  G^  is  an  association  of 
vertices  of  G^  with  vertices  of  G^  in  such  a way  that  (1)  to 
each  vertex  of  G^  there  has  been  associated  a single  vertex 
of  G^ , and  (2)  to  each  vertex  of  G^  there  has  been 
associated  a single  vertex  of  G^.  Thus  the  presence  of  a 
one-to-one  correspondence  between  two  graphs  means  that  they 
at  least  have  the  same  number  of  vertices. 

2.4.2  Number  of  Edges 

The  mapping  0(G^)  = G^  is  an  isomorphism  only  if  the 
number  of  edges  in  the  two  graphs  is  the  same. 

An  isomorphism  induces  a one-to-one  correspondence 
between  edge  sets  and  since  edge  sets  are  finite  also 
(because  the  vertex  sets  are  finite) , so  isomorphic  graphs 
must  have  the  same  number  of  edges. 

2.4.3  Vertex  Label 

In  isomorphic  graphs  corresponding  vertices  must  have 
the  same  vertex  label,  because  isomorphism  preserves  vertex 
labels.  The  mapping  0 with  0(v^j^)  = v^j  is  an  isomorphism 
only  if  the  vertex  labels  in  the  two  vertices  are  the  same. 
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Assume  vertex  label  is  applied  to  the  graphs  in  Figure  2.5. 
It  is  first  observed  that  those  vertices  of  with  vertex 
label  L3  must  correspond  to  those  vertices  of  with  vertex 
label  L3 . That  is,  vertex  B must  correspond  to  either 
vertex  b or  vertex  c and  no  others,  and  vertex  E must 
correspond  to  either  vertex  b or  c. 

2.4.4  Vertex  Label  Distribution 

No  isomorphism  can  exist  between  two  graphs,  G^=(V^,E^) 
and  g2=(v2,e2),  if  the  number  of  vertices  of  G^  with  vertex 
label  L]^,  is  not  equal  to  the  number  of  vertices  of  G^  with 
vertex  label  Lj^.  The  isomorphism  of  G^  onto  G^  may  map 
vertex  v of  G^  onto  vertices  Vj^  of  G^  only  if  the  vertex 
label  of  V in  G^  is  equal  to  the  vertex  label  of  Vj^  in  G^ , 
for  all  i.  Thus,  G^  and  G^  are  isomorphic  only  if  nj^ 
vertices  of  both  graphs  have  vertex  label  Lj^,  for  all  i.  If 
the  number  of  vertices  in  G^  with  vertex  label  Lj^  is  not 
equal  to  the  number  of  vertices  in  G^  with  vertex  label  Lj^, 
for  all  i,  then  we  conclude  that  the  two  graphs  are  not 
isomorphic.  To  illustrate  this,  let  us  consider  the  graphs 
in  Figure  2.6.  Graph  G^  has  four  vertices  with  vertex  label 
L3  and  two  vertices  with  vertex  label  L3 , whereas  graph  G2 
has  two  vertices  with  vertex  label  L3  and  four  vertices  with 
vertex  label  L3 . Since  the  vertex  label  distributions  of 
these  two  graphs  are  different,  we  conclude  that  graphs  G^^ 
and  G2  are  not  isomorphic. 
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Vertex 

Vertex  Label 

A 

Ll 
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L3 

C 

Li 
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Ll 
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■ L3 
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Ll 

Vertex 

Vertex  Label 

a 

Li 

b 

L3 

c 

L3 

d 

Li 

e 

Ll 

f 

Li 

Figure  2.5  An  Example  Illustrating  That  Vertex  Label  Can  Be 
Used  as  an  Invariant  in  the  Vertex  Partitioning. 
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G' 


Vertex 

Vertex  Label 
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Ll 

B 

L3 
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L3 
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Vertex 

Vertex  Label 
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Li 
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L2 
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L2 
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L2 

e 

Ll 
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L2 

Figure  2.6  An  Example  Illustrating  That  Vertex  Label 

Distribution  Can  Be  Used  as  an  Invariant  in 
the  Isomorphism  Test. 
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2.4.5  Edge  Label  Distribution 

No  isomorphism  can  exist  between  two  graphs,  G^=(V^,E^) 
and  g2=(v2,e2),  if  the  number  of  edges  of  G^  with  edge  label 
1]^,  is  not  egual  to  the  number  of  vertices  of  G^  with  edge 
label  1]^.  Thus,  G^  and  G^  are  isomorphic  only  if  nj^  edges 
of  both  graphs  have  edge  label  Ij^,  for  all  i.  If  the 

number  of  edges  in  G^  with  edge  label  Ij^  is  not  equal  to 

the  number  of  edges  in  G^  with  edge  label  1^,  for  all  i, 
then  we  conclude  that  the  two  graphs  are  not  isomorphic. 
I'o  illustrate  this,  let  us  consider  the  graphs  in  Figure 
2.7.  Graph  G^  has  two  edges  with  edge  label  1^,  two  edges 
with  edge  label  I2,  and  four  edges  with  edge  label  I3, 

whereas  graph  G2  has  one  edge  with  edge  label  1^,  two  edges 
with  edge  label  I2,  and  five  edges  with  edge  label  I3. 

Since  the  edge  label  distributions  of  these  two  graphs  are 
different,  we  conclude  that  graphs  G^  and  G2  are  not 
isomorphic. 

2.4.6  Edge  Adjacency  Vector 

Let  us  first  state  one  condition  that  must  be  satisfied 
by  two  vertices  v^^  e and  v^^  e , if  there  exists  an 
isomorphism  of  G^  onto  G^  mapping  v^^  onto  A vertex 

v^u  belonging  to  can  be  mapped  onto  a vertex 

belonging  to  by  an  isomorphism  of  G^  onto  G^  only  if  for 
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G' 


r'Z 


Figure  2.7  An  Example  Illustrating  That  Edge  Label 
Distribution  Can  Be  Used  as  an  Invariant 
in  the  Isomorphism  Test. 
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each  edge  containing  v^u  (i.e.  having  as  initial  or 
final  vertex)  there  exists  an  equivalent  edge  containing 
V This  condition  is  immediately  derived  from  the 
definition  of  graph  isomorphism.  In  other  words,  a vertex 
v^^  belonging  to  can  be  mapped  onto  a vertex  v^^ 
belonging  to  by  an  isomorphism  of  onto  Qp- , only  if  for 
each  edge  label  1]^,  the  number  of  edges  belonging  to  Ij^  and 
containing  is  equal  to  the  number  of  edges  belonging  to 
1]^  and  containing  v^^. 

If  vertex  v^  has  m linked  edges,  e^i,  the 
edge  adjacency  vector  of  vertex  vj^  can  be  formed  by  grouping 
its  edge  label  set  l(ej^3^),  l(ei2)^  **•/  l(®im)  then 
arranging  the  edge  label  set  in  ascending  numeric  or 
alphabetic  order,  where  1 (®ij ) denotes  the  edge  label  of  the 
jth  linked  edge  eij . For  instance,  consider  graph  in 
Figure  2.8.  There  are  four  edges  linking  with  vertex  B. 
They  are  egj^,  eg^f  ®BF  with  edge  labels  I2,  li,  I3, 
and  I3  respectively.  The  edge  adjacency  vector  of  vertex  B 
is  then  obtained  by  grouping  and  reordering  its  linking  edge 
label  set,  which  is  (11,12,13,13). 

Assume  edge  adjacency  vector  is  applied  to  the  graphs  in 
Figure  2.8.  It  is  observed  that  those  vertices  of  G^  with 
edge  adjacency  vector  (12,13)  must  correspond  to  those 
vertices  of  G^  with  edge  adjacency  vector  (12,13).  That 
is,  vertex  A must  correspond  to  either  vertex  e or  vertex  f 
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G' 


Figure  2.8  An  Example  Illustrating  That  Edge  Adjacency 
Vector  Can  Be  Used  as  an  Invariant  in  the 
Vertex  Partitioning. 
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and  no  others,  and  vertex  C must  correspond  to  either  vertex 
e or  f. 


2.5  The  Label  Graph  Isomorphism  Algorithm 

In  addition  to  the  invariants  discussed  in  section  2.4, 
we  propose  some  feature-oriented  invariants  which  will  be 
discussed  in  later  chapters.  Two  kinds  of  invariants  are 
proposed  in  this  dissertation:  the  graph  invariant  and  the 
vertex  invariant,  the  former  is  employed  in  the  isomorphism 
test  to  eliminate  those  non-isomorphic  graphs  at  early  steps 
the  latter  is  employed  in  the  vertex  partitioning  to 
delineate  the  one-to-one  vertex  correspondence  to  ensure  the 
isomorphism. 

The  graph  invariants  that  we  propose  are  the  canonical 
code  (Chapter  4) , the  canonical  string  (Chapter  4) , and  the 
connectivity  code  (Chapter  5) . 

The  vertex  invariants  that  we  propose  are  the  simple 
cycle  invariants  (Chapter  3) , the  cycle  invariants  (Chapter 
3),  the  connectivity  vector  (Chapter  5),  and  the  distance 
list  (Chapter  5) . 

The  proposed  algorithm  is  effective  for  a large  class  of 
label  graphs.  Two  classes  of  label  graphs  are  considered  in 
this  dissertation.  They  are  (1)  the  label  graphs  under 
rotation,  translation,  or  scale  changes,  and  (2)  the  label 
graphs  under  isomorphism. 
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2.5.1  Graph  Isomorphism  Algorithm  for  the  Graphs  Under 
Pictorial  Rotation.  Reflection,  and  Shear 

The  graph  isomorphism  algorithm  given  below  is  for  the 
graph  diagram  under  pictorial  rotation,  reflection,  and 
shear. 

Algorithm  2-1 

Step  1:  Compare  the  number  of  vertices  of  the  model  graph 
with  that  of  the  object  graph.  If  the  number  of  vertices  of 
these  two  graphs  are  identical,  go  to  step  2.  Otherwise, 
continue  the  algorithm  with  another  model  graph  and  go  to 
step  1 . 

Step  2 : Compare  the  number  of  edges  of  the  model  graph  with 
that  of  the  object  graph.  If  the  number  of  edges  of  these 
two  graphs  are  identical,  go  to  step  3.  Otherwise, 
continue  the  algorithm  with  another  model  graph  and  go  to 
step  1 . 

Step  3:  Compare  the  connectivity  code  of  the  model  graph 
with  that  of  the  object  graph.  If  the  connectivity  codes  of 
these  two  graphs  are  identical,  go  to  step  4.  Otherwise, 
continue  the  algorithm  with  another  model  graph  and  go  to 
step  1 . 

Step  4:  Compare  the  canonical  code  of  the  model  graph  with 
that  of  the  object  graph.  If  the  canonical  codes  of  these 
two  graphs  are  identical,  then  the  two  graphs  are  isomorphic 
and  every  vertex  in  the  model  graph  is  uniquely  paired  off 
with  a vertex  in  the  object  graph  according  to  their  order 
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in  the  final  canonical  orderings.  Terminate  the  algorithm. 
Otherwise,  go  to  step  5. 

Step  5:  Compare  the  canonical  string  of  the  model  graph  with 
that  of  the  object  graph.  If  the  canonical  strings  of  these 
two  graphs  are  identical,  go  to  step  6.  Otherwise,  continue 
the  algorithm  with  another  model  graph  and  go  to  step  1. 

Step  6:  Partition  the  vertex  sets  of  the  model  graph  and  the 
object  graph  by  applying  the  vertex  label  and/or  edge 
adjacency  vector.  If  the  cardinality  violation  occurs 
during  the  vertex  partitioning,  then  these  two  graphs  are 
non-isomorphic.  Continue  the  algorithm  with  another  model 
graph  and  go  to  step  1.  Otherwise,  check  if  the 
correspondence  matrix  is  a diagonal  matrix.  If  so,  the 
one-to-one  vertex  correspondence  is  found.  Terminate  the 
algorithm.  Otherwise,  go  to  step  7. 

Step  7:  Partition  the  vertex  cell  sets  of  the  model  graph 
and  the  object  graph  generated  in  step  6 by  applying  the 
simple  cycle  invariants  if  the  number  of  simple  cycles  is 
less  than  the  preset  threshold.  If  the  cardinality 
violation  occurs  during  the  vertex  partitioning,  then  these 
two  graphs  are  non-isomorphic.  Continue  the  algorithm  with 
another  model  graph  and  go  to  step  1.  Otherwise,  check  if 
the  correspondence  matrix  is  a diagonal  matrix.  If  so,  the 
one-to-one  vertex  correspondence  is  found.  Terminate  the 
algorithm.  Otherwise,  go  to  step  8. 
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Step  8:  Partition  the  vertex  cell  sets  of  the  model  graph 
and  the  object  graph  generated  in  step  7 by  applying  the 
connectivity  vector.  If  the  cardinality  violation  occurs 
during  the  vertex  partitioning,  then  these  two  graphs  are 
non-isomorphic.  Continue  the  algorithm  with  another  model 
graph  and  go  to  step  1.  Otherwise,  check  if  the 
correspondence  matrix  is  a diagonal  matrix.  If  so,  the  one- 
to-one  vertex  correspondence  is  found.  Terminate  the 
algorithm.  Otherwise,  go  to  step  9. 

Step  9:  Partition  the  vertex  cell  sets  of  the  model  graph 
and  the  object  graph  generated  in  step  8 iteratively  by 
applying  the  relaxation  partitioning.  The  algorithm  will 
always  terminate  because  the  relaxation  partitioning  using 
the  distance  list  is  guaranteed  to  lead  to  the  generation  of 
isomorphic  partitioning.  If  the  cardinality  violation 
occurs  during  the  relaxation  partitioning,  then  these  two 
graphs  are  non-isomorphic.  Continue  the  algorithm  with 
another  model  graph  and  go  to  step  1. 


2.5.2  General  Graph  Isomorphism  Algorithm 

The  general  graph  isomorphism  algorithm  for  can  be 
stated  as  follows: 

Algorithm  2-2 

Step  1:  Compare  the  number  of  vertices  of  the  model  graph 

with  that  of  the  object  graph.  If  the  number  of  vertices  of 
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these  two  graphs  are  identical,  go  to  step  2.  Otherwise, 
continue  the  algorithm  with  another  model  graph  and  go  to 
step  1. 

Step  2 : Compare  the  number  of  edges  of  the  model  graph  with 
that  of  the  object  graph.  If  the  number  of  edges  of  these 
two  graphs  are  identical,  go  to  step  3.  Otherwise, 
continue  the  algorithm  with  another  model  graph  and  go  to 
step  1. 

Step  3:  Compare  the  connectivity  code  of  the  model  graph 
with  that  of  the  object  graph.  If  the  connectivity  codes  of 
these  two  graphs  are  identical,  go  to  step  4.  Otherwise, 
continue  the  algorithm  with  another  model  graph  and  go  to 
step  1 . 

Step  4:  Compare  the  canonical  code  of  the  model  graph  with 
that  of  the  object  graph.  If  the  canonical  codes  of  these 
two  graphs  are  identical,  then  the  two  graphs  are  isomorphic 
and  every  vertex  in  the  model  graph  is  uniguely  paired  off 
with  a vertex  in  the  object  graph  according  to  their  order 
in  the  final  canonical  orderings.  Terminate  the  algorithm. 
Otherwise,  go  to  step  5. 

Step  5:  Compare  the  canonical  string  of  the  model  graph  with 
that  of  the  object  graph.  If  the  canonical  strings  of  these 
two  graphs  are  identical,  go  to  step  6.  Otherwise,  continue 
the  algorithm  with  another  model  graph  and  go  to  step  1. 

Step  6:  Partition  the  vertex  sets  of  the  model  graph  and  the 
object  graph  by  applying  the  vertex  label  and/or  edge 
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adjacency  vector.  If  the  cardinality  violation  occurs 
during  the  vertex  partitioning,  then  these  two  graphs  are 
non-isomorphic.  Continue  the  algorithm  with  another  model 
graph  and  go  to  step  1.  Otherwise,  check  if  the 
correspondence  matrix  is  a diagonal  matrix.  If  so,  the 

one-to-one  vertex  correspondence  is  found.  Terminate  the 
algorithm.  Otherwise,  go  to  step  7. 

Step  7:  Partition  the  vertex  cell  sets  of  the  model  graph 

and  the  object  graph  generated  in  step  6 by  applying  the 

cycle  invariants  if  the  number  of  simple  cycles  is  less  than 
the  preset  threshold.  If  the  cardinality  violation  occurs 
during  the  vertex  partitioning,  then  these  two  graphs  are 
non-isomorphic.  Continue  the  algorithm  with  another  model 
graph  and  go  to  step  1.  Otherwise,  check  if  the 

correspondence  matrix  is  a diagonal  matrix.  If  so,  the 

one-to-one  vertex  correspondence  is  found.  Terminate  the 
algorithm.  Otherwise,  go  to  step  8. 

Step  8:  Partition  the  vertex  cell  sets  of  the  model  graph 

and  the  object  graph  generated  in  step  7 by  applying  the 

connectivity  vector.  If  the  cardinality  violation  occurs 
during  the  vertex  partitioning,  then  these  two  graphs  are 
non-isomorphic.  Continue  the  algorithm  with  another  model 
graph  and  go  to  step  1.  Otherwise,  check  if  the 

correspondence  matrix  is  a diagonal  matrix.  If  so,  the  one- 
to-one  vertex  correspondence  is  found.  Terminate  the 

algorithm.  Otherwise,  go  to  step  9. 
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9:  Partition  the  vertex  cell  sets  of  the  model  graph 
and  the  object  graph  generated  in  step  8 iteratively  by 
applying  the  relaxation  partitioning.  The  algorithm  will 
always  terminate  because  the  relaxation  partitioning  using 
the  distance  list  is  guaranteed  to  lead  to  the  generation  of 
isomorphic  partitioning.  If  the  cardinality  violation 
occurs  during  the  relaxation  partitioning,  then  these  two 
graphs  are  non— isomorphic.  Continue  the  algorithm  with 
another  model  graph  and  go  to  step  1. 

2.6  Notations  and  Definitions 

Since  the  algorithms  developed  in  this  dissertation  are 
graph  oriented,  a number  of  graph  related  terms  will  be 
used.  In  this  section,  basic  notations  and  definitions 
consistent  with  those  used  in  the  study  of  graph  theory  will 
be  presented.  These  notations  and  definitions  are  more  or 
less  standard  in  the  literature  (Harry  [1969],  Deo  [1974], 
and  Tutte  [1984]). 

GRAPH:  A graph  G is  an  ordered  pair  G=(V,E)  where  V is  a 
finite,  nonempty  set  and  E is  a finite  set  of  unordered 
pairs  (vi,vj)  representing  a relation  on  V. 

VERTEX:  Members  of  the  set  V are  called  vertices.  If  there 
are  n members  in  the  set  V,  then  the  order  or  cardinality  of 
V = n. 

EDGE:  Members  of  the  set  E are  called  edges.  The  relation 
"(vi,Vj)  is  an  edge  of  G"  is  denoted  by  v^  > Vj . 
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MXJACENCY;  Two  vertices,  and  Vj  are  adjacent  if  (vi,Vj)  e 
E.  In  other  words,  if  vertices  v^  and  Vj  are  endpoints  of 
one  edge,  then  Vj^  and  Vj  are  said  to  be  adjacent  to  each 
other. 

ADJACENCY — MATRIX,  A,  OF  G(V,E)  ; A binary  n x n matrix.  A, 
such  that  aj^j  = 1 if  edge  (vj^,Vj)  e E;  aj^j  = 0 otherwise  (n 
is  the  order  of  G)  . A connectivity  between  two  vertices,  Vj^ 
and  Vj , is  indicated  by  a ' 1 ' at  the  intersection  of  their 
columns  and  rows,  i.e.,  the  matrix  elements  aj^j  and  ajj^  are 
each  set  to  1.  Matrix  elements  representing  no  connectivity 
are  zero. 

The  adjacency  matrix  of  every  undirected  graph  is 
symmetric  about  the  main  diagonal  since  if  vertices  Vj^  and 
Vj  are  adjacent,  then  aj^j  = ajj^  = 1 in  the  adjacency  matrix. 
DEGREE:  The  degree  d(v)  of  a vertex  v is  the  number  of  edges 
incident  with  v. 

We  can  easily  find  the  degree  of  a vertex  by  looking  at 
the  adjacency  matrix  associated  with  the  graph.  The  sum  of 
the  entries  in  row  i is  just  d(vj^)  . 

EDGE  NUMBER:  The  edge  number  of  a graph  is  the  number  of 
edges  in  the  graph. 

We  can  easily  find  the  edge  number  of  a graph  by  looking 
at  the  adjacency  matrix  associated  with  the  graph.  The  sum 
of  the  entries  in  row  i is  d(vj^)  . The  sum  of  the  degrees  is 
then  S-j^d(vj^),  which  equals  the  sum  of  the  entries  over  all 
rows.  Since  the  graph  is  undirected,  each  edge  (vi,vj)  is 
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represented  twice  in  this  adjacency  matrix;  one  for  Vj^  and 
one  for  Vj . Thus,  in  any  undirected  graph,  the  number  of 
edges  equals  half  the  sum  of  the  degrees. 

PATH;  The  relation  "There  is  a path  from  to  v j " is 

denoted  by  p:  > vj . A path  p:  > vj  in  G is  a 

sequence  of  vertices  and  edges  leading  from  to  Vj . 

SIMPLE — PATH : A path  is  simple  if  all  its  vertices  are 

distinct.  A simple  path  p:  vj^ > vj  is  an  alternating 

sequence  of  vertices  and  edges  of  G leading  from  v^  to  v j , 
such  that  every  edge  is  immediately  preceded  and  succeeded 
by  the  two  vertices  with  which  it  is  adjacent  and  no  vertex 
occurs  more  than  once. 

PATH — LENGTH:  The  length  of  a path  is  defined  as  the  number 
of  edges  in  the  path. 

CYCLE ; A path  p:  vj^ > vj^  is  called  a cycle;  its  starting 

vertex  and  terminating  vertex  are  the  same.  According  to 

this  definition,  a path  p:  v^  > v^  is  a cycle  if  all  its 

vertices  and  edges  are  distinct  and  the  only  vertex  to  occur 
twice  in  p is  Vj^,  which  occurs  exactly  twice. 

Two  cycles  which  are  cyclic  permutations  of  each  other 
are  considered  to  be  the  same  cycle. 

S IMPLE  C Y CLE ; A path  p:  V£ > Vj^  is  called  a simple  cycle 

if  its  area  is  entirely  surrounded  by  edges  of  the  graph 
without  containing  any  edge. 

CONNECTED:  A graph  is  connected  if  every  pair  of  distinct 
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vertices  (vj^,vj)  are  joined  by  at  least  one  path  p:  Vj^ > 

^j* 

In  making  a formal  definition  it  is  easier  to  consider 
the  opposite  property;  a graph  is  disconnected  if  its 
vertex-set  can  be  partitioned  into  two  subsets,  and  V2 , 
which  have  no  common  elements,  in  such  a way  that  the  graph 
contains  no  edge  with  one  endpoint  in  V^,  and  the  other  in 
V2;  if  a graph  is  not  disconnected  then  it  is  connected. 
REACHABLE ; Let  u and  w be  any  two  vertices  of  a graph  G.  We 
say  that  vertex  w is  reachable  from  vertex  u if  (1)  u is 
adjacent  to  w,  or  (2)  there  is  a finite  seguence  of  edges 
(u,V3^)  , (v^,V2)  , •••  , (vj^_2,vj^)  , (vj^,w)  , all  contained  in 

G.  In  other  words,  vertex  w is  reachable  from  vertex  u if 
there  exists  a path  from  vertex  u to  vertex  w.  If  the  graph 
is  connected,  then  every  vertex  is  reachable  from  every 
other  vertex. 

DISTANCE:  Let  G = (V,E)  be  a connected  graph.  There  may  be 
several  different  paths  from  a given  vertex  v^  to  another 
vsrtex  Vj , and  these  will  have  different  lengths.  We  define 
the  length  of  a path  as  the  number  of  edges  in  the  path. 
Let  us  write  d^j  for  the  shortest  of  all  these  lengths,  and 

it  the  distance  from  Vj^  to  Vj  . The  distance  from  Vj^  to 
' *^ij ' can  be  found  by  looking  at  all  the  possible  paths 
from  Vj^  to  Vj  . 


dij  = min  p(vi,vj) 


(2-4) 
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where  p(vj^,Vj)  ranges  over  all  paths  from  vertex  Vj^  to 
vertex  Vj  and  denotes  the  sum  over  all  edges  in  the  path 
from  Vj^  to  Vj  . 

NEIGHBORHOOD : A neighborhood  of  a vertex  Vj^  is  that  set  N of 
vertices  Vj  such  that  Vj  e N if  and  only  if  (vi,vj)  e E. 
TREE;  A tree  is  a connected  graph  with  no  cycles.  A rooted 
tree  is  a tree  with  the  root  at  the  top,  vertices  adjacent 
to  the  root  are  drawn  in  a horizontal  line  below  the  root. 
The  root  is  said  to  be  at  level  zero,  and  those  vertices 
adjacent  to  the  root  are  at  level  one.  Edges  are  called 

branches  and  vertices  are  called  nodes.  Nodes  at  level  i 

are  successors  of  the  node  at  level  i-l  to  which  they  are 
adjacent.  A node  whose  degree  is  one  is  called  a leaf. 


CHAPTER  III 

INVARIANT  GENERATION  BASED  ON  CYCLE  ANALYSIS 

One  of  the  most  important  features  of  a graph  is  the 
nature  of  its  cycle  structure.  The  cycle  structure  of  a 
graph  often  gives  significant  clues  to  the  cyclic 
information  of  the  graph.  Consequently,  there  is  a strong 
need  for  methods  of  analyzing  the  cycle  structure  of  a 
graph.  Most  of  the  work  in  this  chapter  is  directed  towards 
analyzing  the  cycle,  as  an  approach  to  generate  the  vertex 
invariants,  which  are  applicable  to  the  vertex  partitioning 
for  a cyclically  involved  graph. 

According  to  the  definition,  a cycle  is  a sequence  of 
vertices  and  edges  leading  from  vertex  i to  vertex  i,  such 
that  all  its  edges  are  distinct  and  the  only  vertex  to  occur 
twice  in  the  cycle  is  vertex  i,  which  occurs  exactly  twice. 
In  the  geometrical  plane,  a special  cycle  configuration 
called  simple  cycle  is  defined  as  an  area  entirely 
surrounded  by  edges  of  the  planar  graph.  A simple  cycle 
must  be  a cycle,  but  a cycle  may  not  be  a simple  cycle. 

In  this  chapter  two  groups  of  vertex  invariants  are 
proposed.  The  first  group  of  vertex  invariants  called  the 
simple  cycle  invariants  are  computed  for  each  vertex  by 
analyzing  the  simple  cycles  to  which  it  belongs.  The  simple 
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cycle  invariants  are  especial  useful  for  the  planar  graphs 
under  pictorial  rotation,  reflection,  and  shear.  Since  any 
two  planar  graphs  under  pictorial  rotation,  reflection,  and 
shear  must  have  the  same  simple  cycle  structure,  the  simple 
cycle  invariants  can  be  used  as  vertex  invariants  in  the 
vertex  partitioning.  The  second  group  of  vertex  invariants 
the  cycle  invariants  are  computed  for  each  vertex  by 
analyzing  the  cycles  to  which  it  belongs.  The  cycle 
invariants  provide  important  information  for  the  graphs 
under  isomorphism.  Since  any  two  graphs  under  isomorphism 
must  have  the  same  cycle  structure  (Deo  [1974]),  the  cycle 
invariants  can  be  used  as  vertex  invariants  in  the  vertex 
partitioning  for  the  graphs  under  isomorphism. 


3 . 1 Generation  of  Simple  Cycle  Invariants 

A planar  graph  divides  the  plane  into  a number  of 
regions  which  we  shall  call  simple  cycles.  A simple  cycle 
is  an  area  entirely  surrounded  by  edges  of  the  planar  graph, 
which  contains  no  edge.  Vertices  and  the  associated  edges 
form  the  simple  cycles  of  the  planar  graph.  A planar  graph 
can  be  meaningfully  described  as  a conglomerate  of  vertices, 
edges,  and  simple  cycles  with  properties  attached  to  them. 

In  this  section,  we  present  an  algorithm  for  extracting 
simple  cycles  from  a graph.  A simple  cycle  matrix  is 
obtained  from  the  extracted  simple  cycles  of  the  graph.  The 
simple  cycle  invariants  for  each  vertex  are  then  computed 
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based  on  the  simple  cycle  matrix.  The  simple  cycle 
invariants  provide  important  information  for  the  planar 
graphs  under  pictorial  rotation,  reflection,  and  shear  and 
therefore  can  be  used  as  vertex  invariants  in  the  vertex 
partitioning  to  partition  the  vertex  sets  into  smaller 
subsets. 

3.1.1  Simple  Cycle  Extraction  Algorithm 

Since  a simple  cycle  is  a simple  and  closed  path  without 
enclosing  any  edge,  we  can  use  this  fact  to  derive  an 
algorithm  for  finding  simple  cycles.  Our  proposed  simple 
cycle  extraction  algorithm  (Tou  and  Fan  [1988])  makes  use  of 
the  tree  tracing  technique  to  generate  a tree  and  the 
constraint  verification  technique  to  identify  a simple 
cycle.  If  we  treat  the  starting  vertex  of  a simple  cycle  as 
the  root  and  the  terminating  vertex  as  the  leaf,  the  simple 
cycle  can  be  represented  by  a path  from  the  root  to  the  leaf 
in  the  simple  cycle  tree.  If  the  graph  contains  f simple 
cycles,  then  f simple  cycle  trees  may  be  generated. 

The  algorithm  to  extract  the  simple  cycles  from  a graph 
is  given  below: 

Algorithm  3-1 

Step  1:  List  all  vertices  of  the  graph  according  to  their 
order  of  appearance  in  the  scanning  process.  Choose  the 
first  vertex  as  the  root  vertex  for  the  first  simple  cycle 
to  be  identified. 
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Step  2;  Generate  a path  by  selecting  the  fresh  neighboring 
vertex  which  is  nearest  to  the  root  vertex.  All  vertices 
are  initially  fresh.  Trace  the  generated  path  of  the  tree 
to  check  whether  a vertex  has  been  traced  twice.  If  so,  the 
path  does  not  represent  a simple  cycle.  If  not,  check 
whether  the  number  of  vertices  equals  the  number  of  edges  in 
the  traced  part  of  the  graph.  If  so,  a simple  cycle  is 
identified.  Record  the  vertices  of  the  simple  cycle  to  form 
a vertex  list  for  the  extracted  simple  cycle. 

If  a simple  cycle  is  not  identified,  backtrack  to  the 
vertices  on  the  previous  level  and  repeat  step  2 . 

Step  3:  Discard  the  singly-connected  vertices  in  the 
previously  extracted  simple  cycle.  Choose  the  first  doubly- 
connected  vertex  as  the  root  vertex  for  the  next  simple 
cycle  to  be  identified  and  go  to  step  2.  A vertex  of  degree 
2 is  said  to  be  singly-connected.  A vertex  which  is  not 
singly-connected  is  doubly-connected. 

Step  4:  Continue  the  above  process  until  all  simple  cycles 
are  identified. 

For  each  simple  cycles  after  the  first,  we  wish  to  find 
only  those  simple  cycles  that  haven't  been  extracted  before. 
If  the  simple  cycle  has  been  extracted  before,  we  have  to 
discard  it  to  avoid  the  extraction  of  redundant  simple 
cycles.  To  avoid  the  formation  of  redundant  simple  cycles, 
we  use  a validity  marker  to  mark  each  extracted  simple  cycle 
which  will  be  traversed  by  later  simple  cycle  trees  as 
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forbidden.  Each  time  a new  tree  is  formed,  the  verification 
procedure  must  test  each  traced  path  to  prevent  traversing  a 
forbidden  simple  cycle  which  has  been  extracted. 

3. 1.1.1  Geometric  constraints  for  the  simple  cycle 

The  simple  cycle  extraction  algorithm  discussed  above 
makes  use  of  several  geometric  constraints  to  limit  the 
domain  of  feasible  solutions.  The  geometric  constraints 
come  from  the  geometric  properties  of  the  simple  cycle.  All 
geometric  constraints  are  analyzed  and  formulated  as 
necessary  conditions  for  extracting  the  simple  cycles 
embedded  in  a graph. 

To  identify  a simple  cycle  in  a graph,  we  will  make  use 
of  the  following  constraints  characterizing  a simple  cycle: 
Constraint  I:  A simple  cycle  in  a graph  G must  be  a simple 
and  closed  path,  which  means  the  edges  and  vertices  of  a 
simple  cycle  cannot  be  traced  more  than  once,  with  the 
exception  of  the  starting  vertex.  In  other  words,  a simple 
cycle  is  a path  with  the  additional  property  that  all  of  its 
vertices  and  all  of  its  edges  are  distinct,  with  the 
exception  of  the  first  and  last  vertices,  which  must 
coincide. 

Consider  the  graph  of  Figure  3.1.  The  path  FBACDB  is 
not  identified  as  a simple  cycle  because  vertex  B is  traced 


twice. 
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A 


E 


H 


Figure  3.1  Graph  3.1. 


Table  3.1  Adjacency  matrix  for  Graph  3.1. 


ABCDEFGH 


A 

B 

C 

D 

E 

F 

G 

H 


0 

1 

1 

0 

0 

0 

0 

0 


1 

0 

1 

1 

0 

1 

0 

0 


1 

1 

0 

1 

0 

0 

0 

0 


0 

1 

1 

0 

1 

0 

0 

0 


0 

0 

0 

1 

0 

0 

0 

1 


0 0 
1 0 
0 0 
0 0 
0 0 
0 1 
1 0 
1 1 


0 

0 

0 

0 

1 

1 

1 

0 
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Constraint  II:  A simple  cycle  is  entirely  surrounded  by 
edges  of  the  graph,  which  contains  no  edge.  Thus,  a simple 
cycle  with  m vertices  can  only  contain  m edges. 

Referring  to  the  graph  in  Figure  3.1,  the  path  FBCDEHF 
is  not  a simple  cycle,  since  it  violates  the  constraint  (it 
contains  seven  edges  and  six  vertices) . 

Each  constraint  forms  a necessary  condition  for  the 
verification  of  a simple  cycle.  In  order  to  be  a simple 
cycle,  the  traced  path  of  the  graph  has  to  satisfy  both 
constraints.  Without  such  constraints  the  first  path  to  be 
explored  could  go  on  forever,  eventually  cycling  back  to 
revisit  previous  states  in  an  infinite  loop.  The  constraint 
verification  process  solves  the  tree  tracing  problem  by 
means  of  a depth-first  search,  finding  locations  for 
successive  paths  and  backtracking  when  it  cannot  proceed. 
The  search  utilizes  the  constraints  so  that  early  rejection 
can  be  realized  by  attacking  the  most  restrictive  constraint 
first. 

The  constraint  verification  process  consists  of 
investigating  the  traced  path  (a^,a2,  • • • ,a]^)  such  that 


IF  Cg(a3^,a2,  • • • ,aj^)  = 0 

THEN  Cg(ai,a2, • • • ,ak,ak+l)  =0  s = 1,  2 (3-1) 

In  other  words,  if  path  (ai f ^2 ' * * * > ^k)  violates  constraint 
Cg,  then  no  extended  path  (ai,a2,  • • • ,ajr,a)^+2)  possibly 
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satisfy  Cg ; hence  by  induction,  no  extended  path 
(^1/^2' * * * satisfy  constraint  Cg. 

3. 1.1. 2 Tree  tracing 

If  we  treat  the  starting  vertex  of  a simple  cycle  as  the 
root  and  the  terminating  vertex  as  the  leaf,  the  simple 
cycle  can  be  represented  as  a path  from  the  root  to  the  leaf 
in  the  simple  cycle  tree.  Thus,  the  simple  cycle  extraction 
problem  can  be  viewed  as  a tree  tracing  problem.  The  root 
of  the  tree  (the  0^^  level)  is  the  null  vector.  Its  sons 
are  the  choices  for  a^,  and  in  general  the  nodes  at  the 
level  are  the  choices  for  aj^,  given  the  choices  made  for  a^, 
^2'***f  ^k-1  indicated  by  the  ancestors  of  these  nodes. 

The  root  of  the  simple  cycle  tree  is  the  vertex  which  is 
encountered  first  in  the  scanning  process.  is  the  set  of 

neighboring  vertices  of  the  root  and  S]^.  is  the  set  of 
vertices  which  are  the  neighbors  of  excluding  the 

parent  of 

Sk  = {N(a]^_i))  - {ak_2)  (3-2) 

where  N(a]^_2)  is  the  neighbors  of  the  vertex  at  the  (k-1)^^ 
level  of  the  simple  cycle  tree,  and  aj^  is  the  vertex  with 
the  shortest  distance  to  aj^_2  among  Sj..  We  use  the  distance 
between  two  consecutive  vertices  as  a criterion  to  determine 
which  branch  to  take  during  tree  tracing.  To  illustrate 
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the  derivation  of  Sj^,  let  us  consider  the  graph  in  Figure 
3.1.  Assume  vertex  A is  chosen  as  the  root  vertex,  i.e.,  ag 
= v^.  There  are  two  branches  from  vertex  A.  They  are  AB 
and  AC.  Therefore,  = {Vg,Vj^}.  Since  branch  AB  is 

shorter  than  branch  AC,  vertex  B is  chosen  as  a^,  which  is 
connected  to  vertices  A,  C,  D,  and  F.  According  to  Eguation 
3-2,  S2  = {va,Vc,Vd,Vf}  - {ao)  = {v^,  v^,  Vp,  Vp)  - {V;^}  = 

Vj3,  vp} . 

The  idea  of  the  search  process  in  the  simple  cycle 
algorithm  is  to  start  at  the  root  and  continually 
to  move  downward  when  possible,  taking  the  branch  with  the 
shortest  distance  whenever  a decision  is  necessary;  but  if 
it  is  impossible  to  continue  downward,  backtrack  by 
considering  the  next  alternative  on  the  previous  level.  The 
problem  we  wish  to  solve  can  be  expressed  abstractly  as  the 
task  of  finding  a path  (^i/ ^2 ' * ' * » ^m)  simple  cycle 

tree  which  satisfies  both  constraints.  This  path  consists 
of  a vector  of  undetermined  length  and  satisfies  both 
constraints  which  makes  it  a solution.  Each  aj^  is  a member 
of  a finite,  linearly  ordered  set  A^.  Thus  the  exhaustive 
search  must  consider  the  elements  of  ApxA2X*  • *xAj^  for  i = 0, 
1/  2,  •••  as  potential  solutions.  Initially  we  start  with 

the  null  vector  <p  as  our  partial  solution,  and  the  adjacency 
tells  us  which  of  the  members  of  A]^  are  candidates 
for  a^;  we  call  this  subset  Sp.  We  choose  the  element  of 
with  the  shortest  distance  as  and  form  the  partial 
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solution  (3]^).  In  general,  the  adjacency  matrix  and  the 
Partial  solution  tell  us  which  subset  Sj^  of  comprises 
candidates  for  the  extension  of  the  partial  solution 
(ai,a2,  • • • ,aj^_l)  to  , a2  , • • • , a)^_]^ , aj^)  . If  the  partial 
solution  (ai,a2,  • • • ,aj^_]^)  admits  no  possibilities  for  aj^, 
then  S]^  = 0,  so  we  backtrack  and  make  a new  choice  for 
If  there  are  no  new  choices  for  aj..^  we  backtrack  still 
further  and  make  a new  choice  for  aj^_2,  and  so  on.  This 
tree  tracing  process  is  illustrated  in  Figure  3.2. 

Below  is  a step  by  step  description  of  the  tree  tracing 
process  in  the  simple  cycle  extraction  algorithm: 

Step  1:  [Initialize.]  Set  k to  0. 

Step  2:  [Compute  the  successors.]  Set  Sj.  to  the  set  of 
Step  3:  [Advance.]  Choose  the  element  with  the  shortest 
distance  in  Sj^,  call  it  ajr+2./  delete  it  from  Sj^. 

Increase  k by  1. 

Step  4:  [Violate  the  constraints?]  If  (a^ , a2  , . . . , aj^) 
violates  the  constraints,  go  to  step  6. 

[Solution  Found?]  If  (a^^ , a2  , . . . , aj^)  is  a simple 
cycle,  output  the  solution  (aj^,a2 , . . . ,aj^)  and  terminate. 
Otherwise  go  to  step  2. 

Step  6:  [Have  all  successors  been  tried?]  If  is  not 
empty,  go  to  step  3. 

Step  7:  [Backtrack.]  Decrease  k by  1 and  go  to  step  6. 

When  a goal  node  is  reached,  the  backward  pointers  can 
be  traversed  to  find  the  simple  cycle.  It  collects  the  path 
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Si  = Ai 

k = 1 

DO  WHILE  k > 0 

DO  WHILE  S]^  = O 

“ element  with  the  shortest  distance  in 
^k  “ ^k“{^k) 

IF  (ai,a2,  . . .aj^)  does  not  violate  constraints  Cg 
THEN  IF  (ai,a2,  . . . ,aj^)  is  a solution 
THEN  record  it  and  stop 
ELSE  k = k+1 

compute  S]^ 

END  IF 

k = k-1 

STOP 


Figure  3.2  Tree  Tracing  Process  in  Simple  Cycle  Extraction 
Algorithm. 
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from  the  goal  node  to  the  root  of  the  tree  and  then  forms  a 
simple  cycle  consisting  of  the  nodes  of  the  path.  Since  we 
know  that  the  length  of  the  path  from  any  node  to  the  root 
along  the  branches  of  a simple  cycle  tree  is  minimal,  we 
need  to  save  the  path  generated  by  the  earliest  simple 
cycle  encountered. 

3. 1.1. 3 Backtracking 

The  goal  of  the  search  is  to  find  a complete  path,  and 
the  search  space  consists  of  nodes  representing  partially 
completed  paths  and  branches  representing  each  alternative 
continuation  from  the  partial  paths.  The  search  usually 
requires  an  exhaustive  search  of  the  set  of  all  potential 
paths.  It  forces  the  system  to  make  choices  between 
multiple  alternatives  as  it  proceeds  through  the  tree. 
Alternatives  may  be  dealt  with  all  at  the  same  time,  through 
parallel  processing,  or  one  at  a time,  using  a form  of 
backtracking;  back  up  to  a previous  choice-point  in  the 
tree  and  try  again.  At  each  stage  of  the  search,  if  an 
extension  of  the  current  partial  path  is  not  possible,  it 
backtracks  to  another  path  and  tries  again.  The  search 
process  requires  a significant  amount  of  bookkeeping  to  keep 
track  of  the  multiple  possibilities:  all  the  ones  not  yet 
tried,  in  the  case  of  backtracking.  Here,  we  use  stacks  to 
keep  track  of  elements  during  tree  tracing  and 
backtracking.  The  implementation  of  a stack  is  achieved  by 
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using  a list  structure.  For  example,  the  list  structure 
implementation  for  the  path  shown  in  Figure  3.3(a)  is  shown 
in  Figure  3.3(b).  Shown  in  Figure  3.3(b)  is  the  list 
structure  of  the  path  (ABCHGB) . Vertex  A is  the  starting 
vertex  in  the  list  structure.  There  are  two  branches  from 
vertex  A.  They  are  AB  and  AF.  Since  branch  AB  is  shorter 
than  branch  AF , the  list  will  continue  from  vertex  B which 
is  connected  to  vertices  C and  E.  Since  branch  BC  is 
shorter  than  branch  BE,  the  list  will  follow  branch  BC  to 
vertex  C,  which  is  connected  to  vertices  D and  H.  Since 
branch  CH  is  shorter  than  branch  CD,  the  list  will  follow 
branch  CH  to  vertex  H,  which  is  connected  to  vertex  G. 
Since  branch  HG  is  the  only  branch  starting  from  vertex  H, 
the  list  will  follow  branch  HG  to  vertex  G,  which  is 
connected  to  vertex  B.  A path  (ABCHGB)  is  generated.  Since 
vertex  B is  traced  twice,  it  violates  the  constraint.  Now, 
the  tree  searching  process  backtracks  to  vertex  C and 
continues  the  process  from  vertex  D until  a valid  simple 
cycle  is  identified. 

Backtracking  involves  retracing  from  the  choice-point 
that  failed  when  a constraint  violation  is  found.  The  basic 
idea  of  backtracking  is  to  build  up  the  path  one  vertex  at  a 
time  and  to  test  whether  the  path  being  formed  still  has  a 
chance  of  success.  The  path  can  be  either  accepted  or 
rejected  on  the  basis  of  constraint  verification. 
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Figure  3 . 


(a) 


3 a)  A Path  in  a Tree,  b)  The  List  Structure 
Implementation  of  the  Path. 
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(b) 


Figure  3.3  (Continued). 
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3. 1.1. 4 Illustration  example 

To  illustrate  the  identification  of  simple  cycles,  let 
us  consider  the  graph  as  shown  in  Figure  3.1.  Figure  3.1 
represents  a graph  which  contains  four  simple  cycles.  The 
first  simple  cycle  tree  of  this  graph  is  generated  in  Figure 
3.4(a).  Figure  3.4(a)  indicates  how  Algorithm  3-1  might 
traverse  this  simple  cycle  tree  if  vertex  A were  selected  as 
the  root.  Vertex  A is  chosen  as  the  root  vertex  since  it  is 
scanned  first.  There  are  two  branches  from  vertex  A.  They 
are  AB  and  AC.  Since  branch  AB  is  shorter  than  branch  AC, 
the  simple  cycle  tree  will  continue  from  vertex  B which  is 
connected  to  vertices  C,  D and  F.  Since  branch  BD  is 
shorter  than  branches  BC  and  BF,  the  simple  cycle  tree  will 
follow  branch  BD  to  vertex  D,  which  is  connected  to  vertices 
C and  E.  The  simple  cycle  tree  will  follow  branch  DC  to 
vertex  C,  which  is  connected  to  vertices  A and  B.  Since 
branch  CB  is  shorter  than  branch  CA,  the  simple  cycle  tree 
will  follow  branch  CB  to  vertex  B.  The  path  ABDCB  is  not  a 
simple  cycle,  since  vertex  B has  been  traced  twice.  The 
tracing  procedure  backtracks  to  vertex  C which  is  the 
immediate  branching  point.  Following  the  above  procedure  a 
path  ABDCA  is  generated.  This  path  does  not  represent  a 
simple  cycle  since  the  number  of  vertices  is  not  equal  to 
the  number  of  edges.  The  tracing  procedure  backtracks  to 
vertex  D,  the  immediate  branching  point,  and  generates  paths 
ABDEHGFH,  ABDEHGFB,  ABDEHFB,  and  ABDEHFGH.  These  paths  do 
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not  represent  simple  cycles  since  vertex  B or  vertex  H has 
been  traced  twice.  The  procedure  now  backtracks  to  vertex  B 
to  generate  a path  ABCA,  which  represents  a simple  cycle. 
The  paths  with  the  'X'  marks  represent  the  paths  violating 
the  constraints. 

Since  vertex  A is  singly-connected,  it  is  discarded  in 
the  second  simple  cycle  tree  generation.  Since  vertex  B is 
the  first  doubly-connected  vertex  in  the  vertex  list  of  the 
first  simple  cycle,  it  is  chosen  as  the  root  vertex  for  the 
second  simple  cycle  tree,  which  is  generated  in  Figure 
3.4(b).  Figure  3.4(b)  represents  the  second  tree  grown  by 
the  algorithm.  The  path  BDCB  represents  a simple  cycle. 
Since  vertex  C is  singly-connected,  it  is  discarded  in  the 
third  simple  cycle  tree  generation.  Since  vertex  D is  the 
first  doubly-connected  vertex  in  the  vertex  list  of  the 
second  simple  cycle,  it  is  chosen  as  the  root  vertex  for  the 
third  simple  cycle  tree,  which  is  shown  in  Figure  3.4(c). 
The  path  DBFHED  represents  a simple  cycle.  Since  vertices 
B,  D,  and  E are  singly-connected,  they  are  discarded  in  the 
fourth  simple  cycle  tree  generation.  Since  vertex  F is  the 
first  doubly-connected  vertex  in  the  vertex  list  of  the 
third  simple  cycle,  it  is  chosen  as  the  root  vertex  for  the 
fourth  simple  cycle  tree,  which  is  shown  in  Figure  3.4(d). 
The  path  FGHF  is  a simple  cycle.  Since  this  is  the  last 
simple  cycle  to  be  identified,  the  procedure  is  terminated. 
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(a) 


Figure  3.4 


Illustration  of  Simple  Cycle  Extraction  Process. 
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(b) 


Figure  3.4  (Continue). 
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(d) 


Figure  3.4  (Continue). 
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The  four  simple  cycles  extracted  are  (ABC) , (BDC) , (DBFHE) , 
(FGH) . 

3.1.2  Simple  Cycle  Matrix 

A matrix  called  simple  cycle  matrix  is  developed  to 
store  the  extracted  simple  cycles  of  a graph.  In  a 
connected  graph  with  indexed  vertices,  the  simple  cycles  fj^ 
of  the  graph  have  the  simple  cycle  matrix  F,  each  row  of 
which  characterizes  one  of  the  simple  cycles.  A simple 
cycle  matrix  F is  obtained  by  taking  for  its  row  i,  the 
simple  cycle  i,  and  for  its  columns  the  vertices  from  the 
first  to  the  last  of  the  simple  cycle.  Hence  the  entries  of 
the  matrix  are  1 or  0 depending  on  whether  a vertex  belongs 
to  a given  simple  cycle  or  not.  To  illustrate  the  formation 
of  the  simple  cycle  matrix,  let  us  consider  the  graph  in 
Figure  3.1,  which  has  the  four  simple  cycles;  (ABC),  (BDC), 
(DBFHE),  (FGH).  Let  us  number  them  f^^,  f 2 , f 3 , and  f 4 , 
respectively.  Then  we  can  get  the  4x8  simple  cycle  matrix 
as  shown  in  Table  3.1. 

Table  3.1  Simple  Cycle  Matrix  for  Graph  3.1. 


A 

B 

c 

D 

E 

F 

G 

H 

fi: 

(ABC) 

1 

1 

1 

0 

0 

0 

0 

0 

(BDC) 

0 

1 

1 

1 

0 

0 

0 

0 

^3- 

(DBFHE) 

0 

1 

0 

1 

1 

1 

0 

1 

f4: 

(FGH) 

0 

0 

0 

0 

0 

1 

1 

1 
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where  the  row  of  the  matrix  corresponds  to  the 

extracted  simple  cycle  of  the  graph.  For  instance,  the 
first  extracted  simple  cycle  is  (ABC) , the  elements  in  the 
first  row  of  the  matrix  corresponding  to  vertices  A,  B,  C 
are  all  set  to  1. 

3.1.3  Surrounding  Simple  Cycle  vector 

The  surrounding  simple  cycle  vector  of  a vertex  is  the 
concatenation  of  all  simple  cycles  surrounding  the  vertex. 
The  simple  cycles  surrounding  a vertex,  Vj , can  be  obtained 
from  the  simple  cycle  matrix,  which  are  the  simple  cycles 
with  value  '1'  in  column  j of  the  simple  cycle  matrix.  For 
instance,  consider  the  graph  in  Figure  3.1.  There  are  three 
simple  cycles  surrounding  vertex  B;  they  are  (ABC) , (BDC) , 
and  (DBFHE) . The  surrounding  simple  cycle  vector  of  vertex 
B is  then  obtained  by  concatenating  the  simple  cycles 
surrounding  vertex  B,  which  is  [ (ABC) , (BDC) , (DBFHE) ] . 

3.1.4  Simple  Cycle  Invariants 

The  simple  cycle  invariants  are  properties  of  a vertex 
which  are  invariant  under  pictorial  rotation,  reflection, 
and  shear  of  a planar  graph.  Four  simple  cycle  invariants 
are  proposed.  They  are  (1)  the  surrounding  simple  cycle 
number,  (2)  the  simple  cycle  length  vector,  (3)  the  vertex 
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label  simple  cycle  vector,  and  (4)  the  edge  label  simple 
cycle  vector. 

3. 1.4.1  Surrounding  simple  cycle  number 

The  surrounding  simple  cycle  number  of  a vertex  is  the 
number  of  simple  cycles  surrounding  the  vertex.  The 
surrounding  simple  cycle  number  can  be  obtained  by  counting 
the  number  of  I's  in  column  j of  the  simple  cycle  matrix. 
For  instance,  referring  to  the  graph  in  Figure  3.1,  the 
surrounding  simple  cycle  number  of  vertex  B is  3,  which  is 
obtained  by  counting  the  number  of  '1'  in  column  2 of  the 
simple  cycle  matrix  (column  2 corresponds  to  vertex  B) . 

The  surrounding  simple  cycle  number  can  be  used  as  a 
vertex  invariant  in  the  vertex  partitioning  for  the  graphs 
under  pictorial  rotation,  reflection,  and  shear.  Consider 
two  vertices  v^.  and  Vg  belonging  to  graphs  and 

respectively.  Graph  G^  is  a rotation,  translation,  or 
scale  changes  of  graph  G^.  An  isomorphism  of  G^  onto  G^  can 
map  Vj.  onto  Vg  only  if  the  number  of  simple  cycles 
surrounding  Vj-  equals  the  number  of  simple  cycles 
surrounding  Vg.  To  illustrate  this,  let  us  consider  the 
problem  of  partitioning  the  vertices  of  graphs  G^  and  G^  in 
Figure  3.5.  Graph  G^  is  a rotation  of  graph  G^  by  180 
degree.  Algorithm  3-1  produces  the  set  of  simple  cycles  for 
graph  G^  and  stores  them  in  the  simple  cycle  matrix  as  shown 
in  Table  3.1.  A summary  of  the  surrounding  simple  cycle 
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a l2  b 


G 


1 


G 


2 


Vertex 

Vertex  Label 

A 

Ll 

B 

L3 

C 

L2 

D 

L2 

E 

Ll 

F 

L2 

G 

Ll 

H 

L2 

Vertex 

Vertex  Label 

a 

L2 

b 

Ll 

c 

L2 

d 

Ll 

e 

L2 

f 

L2 

g 

L3 

h 

Ll 

Figure  3.5  A Pair  of  Graphs  Illustrating  the  Usefulness 
of  Simple  Cycle  Invariants  in  the  Vertex 
Partitioning  Process. 
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numbers  for  the  vertices  in  graph  could  be  written  and 
is  shown  in  Table  3.2,  in  which  each  vertex  is  associated 
with  its  corresponding  surrounding  simple  cycle  number. 


Table  3.1  Simple  Cycle  Matrix  for  Graph  G^  in  Figure  3.5. 


A 

B 

c 

D 

E 

F 

G 

H 

(ABC) 

1 

1 

1 

0 

0 

0 

0 

0 

^2’ 

(BDC) 

0 

1 

1 

1 

0 

0 

0 

0 

(DBFHE) 

0 

1 

0 

1 

1 

1 

0 

1 

£4: 

(FGH) 

0 

0 

0 

0 

0 

1 

1 

1 

Table  3.2  A Summary  of  Surrounding  Simple  Cycle  Number  for 
Graph  G^  in  Figure  3.5. 


Vertex 

Surrounding 
Simple  Cycle  Number 

A 

1 

B 

3 

C 

2 

D 

2 

E 

1 

F 

2 

G 

1 

H 

2 

If  the  same  analysis  is  done  for  graph  G^,  the  four 
simple  cycles  extracted  are  (ade) , (dchba) , (hgb) , and 
(gfb) . The  simple  cycle  matrix  and  a summary  of  surrounding 
simple  cycle  number  for  graph  G^  are  shown  in  Table  3.3  and 
Table  3.4  respectively. 
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Table  3.3  Simple  Cycle  Matrix  for  Graph  in  Figure  3.5. 


a 

b 

c 

d 

e 

f 

g 

h 

(abc) 

1 

1 

1 

0 

0 

0 

0 

0 

2 = 

(cadeg) 

1 

0 

1 

1 

1 

0 

1 

0 

3: 

(egf) 

0 

0 

0 

0 

1 

1 

1 

0 

4- 

(ghf) 

0 

0 

0 

0 

0 

1 

1 

1 

Table  3.4  A Summary  of  Surrounding  Simple  Cycle  Number  for 
Graph  g2  in  Figure  3.5. 


Vertex 

Surrounding 
Simple  Cycle  Number 

a 

2 

b 

1 

c 

2 

d 

1 

e 

2 

f 

2 

g 

3 

h 

1 

Partitioning  the  vertex  sets  of  graphs  G^  and  G^  in 
Figure  3.5  based  on  the  surrounding  simple  cycle  number  of 
each  vertex,  the  generated  partitioning  result  is: 


(A,E,G) , (C,D,F,H) ,B 
(b,d,h) , (a,c,e, f) ,g 


3. 1.4. 2.  Simple  cycle  length  vector 

The  length  of  a simple  cycle  is  defined  as  the  number  of 
edges  contained  in  the  simple  cycle.  This  is  of  course. 
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equal  to  the  number  of  vertices  contained  in  this  simple 
cycle.  For  each  vertex  in  a graph,  one  first  enumerates 
simple  cycles  of  different  length  surrounding  the  vertex  and 
constructs  a sequence  of  simple  cycle  length  numbers  for  the 
vertex.  The  simple  cycle  length  vector  of  vertex  k in  graph 
G is  defined  as  vf(k)  = (n3 , n4 , ns , • • • , n j ) , where  nj,  equals 
the  number  of  simple  cycles  of  length  i in  which  vertex  k is 
contained  and  j is  the  longest  simple  cycle  length  of  all 
extracted  simple  cycles  of  graph  G.  Enumeration  of  the 
simple  cycle  length  vector  of  a vertex  is  straightforward: 
one  simply  considers  each  vertex  separately  and  forms  a 
sequence  of  numbers,  the  first  element  giving  the  number  of 
length  3 simple  cycles  surrounding  the  vertex  (since  the 
minimum  length  of  a simple  cycle  is  three) , the  second 
element  giving  the  number  of  length  4 simple  cycles 
surrounding  the  vertex,  and  so  on.  For  instance,  consider 
vertex  B in  graph  G^  of  Figure  3.5.  There  are  three  simple 
cycles  surrounding  vertex  B.  They  are  (ABC) , (BDC) , and 
(DBFHE) , with  length  3,  length  3,  and  length  5, 
respectively.  Since  the  longest  simple  cycle  length  of  all 
extracted  simple  cycles  of  the  graph  is  5,  the  simple  cycle 
length  vector  of  vertex  B is  (n3,n4,n5),  the  first  element 
giving  the  number  of  length  3 simple  cycles  surrounding 
vertex  B,  the  second  element  giving  the  number  of  length  4 
simple  cycles  surrounding  vertex  B,  and  the  third  element 
giving  the  number  of  length  five  simple  cycles  surrounding 
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vertex  B.  There  are  2 simple  cycles  with  length  3,  1 simple 
cycle  with  length  5 surrounding  vertex  B.  Therefore,  the 
simple  cycle  length  vector  of  vertex  B is  (2,0,1). 

The  simple  cycle  length  vector  can  be  used  as  a vertex 
invariant  in  the  vertex  partitioning  for  the  graphs  under 
rotation,  translation,  and  scale  changes.  Consider  two 

vertices  Vj-  and  Vg  belonging  to  graphs  and 

respectively.  Graph  G^  is  a rotation,  translation,  or  scale 
changes  of  graph  G^.  An  isomorphism  of  G^  onto  G^  can  map 
Vr  onto  Vg  only  if  the  simple  cycle  length  vector  of  v^ 
eguals  the  simple  cycle  length  vector  of  Vg.  To  illustrate 
this,  let  us  consider  the  problem  of  partitioning  the 
vertices  of  graphs  G^  and  in  Figure  3.5.  Graph  G^  is  a 
rotation  of  graph  G^  by  180  degree.  A summary  of  the  simple 
cycle  length  vector  for  the  vertices  in  graph  G^  could  be 
''^i^itten  and  shown  in  Table  3.5,  in  which  each  vertex  is 
associated  with  its  corresponding  simple  cycle  length 
vector. 

Table  3.5  A Summary  of  Simple  Cycle  Length  Vector  for  Graph 
in  Figure  3.5. 


Vertex 

Simple  Cycle  Length  Vector 

A 

(1,0,0) 

B 

(2,0,1) 

C 

(2,0,0) 

D 

(1,0,1) 

E 

(0,0,1) 

F 

(1,0,1) 

G 

(1,0,0) 

H 

(1,0,1) 
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If  the  same  analysis  is  done  for  graph  G^,  a summary  of 
simple  cycle  length  vectors  for  the  vertices  in  graph 
could  be  written  and  shown  in  Table  3.6. 

Table  3.6  A Summary  of  Simple  Cycle  Length  Vector  for  Graph 
in  Figure  3.5. 


Vertex 

simple  Cycle  Length  Vector 

a 

(1,0,1) 

b 

(1,0,0) 

c 

(1,0,1) 

d 

(0,0,1) 

e 

(1,0,1) 

f 

(2,0,0) 

g 

(2,0,1) 

h 

(1,0,0) 

Partitioning  the  vertex  sets  of  G^  and  G^  in  Figure  3.5 
by  considering  each  simple  cycle  length  vector  as  a single 
decimal  number,  the  generated  partitioning  result  is: 

E, (A,G) , (D,F,H) ,C,B 
d, (b,h) , (a,c,e) , f,g 

3 » 1.4. 3.  Vertex  label  simple  cycle  vector 

The  vertex  label  simple  cycle  vector  of  a subject  vertex 
is  obtained  by  substituting  the  vertex  label  of  each  vertex 
to  the  surrounding  simple  cycle  vector  of  the  subject 
vertex,  then  enumerating  the  vertex  label  distribution  for 
each  simple  cycle  in  the  surrounding  simple  cycle  vector  and 
reordering  the  generated  numeric  sequence  in  ascending  order 
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if  we  treat  each  vertex  label  distribution  as  a single 
decimal  number.  In  the  vertex  label  simple  cycle  vector  of 
vertex  x,  the  first  element  giving  the  vertex  label 
distribution  of  the  first  simple  cycle  surrounding  vertex  x, 
the  second  element  giving  the  vertex  label  distribution  of 
the  second  simple  cycle  surrounding  vertex  x,  and  so  on. 
The  length  of  each  element  in  the  vertex  label  simple  cycle 
vector  is  equal  to  the  number  of  vertex  labels  in  the  graph. 
For  instance,  consider  vertex  B in  graph  of  Figure  3.5. 
The  surrounding  simple  cycle  vector  of  vertex  B is  [ (ABC) , 
(BDC) , (DBFHE) ] . Substituting  the  vertex  label  of  each 
vertex  into  the  surrounding  simple  cycle  vector  of  vertex  B, 
the  result  generated  is  [ (L1L3L2)  , (L3L2L2)  , (L2L3L2L2L3^)  ] . 
Since  the  number  of  vertex  labels  in  the  graph  is  3,  the 
length  of  each  element  in  the  vertex  label  simple  cycle 
vector  is  3.  By  enumerating  the  vertex  label  distribution 
of  each  simple  cycle  and  reordering  the  generated  numeric 
sequence,  the  vertex  label  simple  cycle  vector  of  vertex  B 
is  then  obtained,  which  is  [ (021) , ( 111) , (131) ] . 

The  vertex  label  simple  cycle  vector  can  be  used  as  a 
vertex  invariant  in  the  vertex  partitioning  for  the  graphs 
under  rotation,  translation,  and  scale  changes.  Consider 
two  vertices  Vj.  and  Vg  belonging  to  graphs  and  G^ 
respectively.  Graph  G^  is  a rotation,  translation,  or  scale 
changes  of  graph  G^.  An  isomorphism  of  G^  onto  G^  can  map 
Vj.  onto  Vg  only  if  the  vertex  label  simple  cycle  vector  of 
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equals  the  vertex  label  simple  cycle  vector  of  Vg.  To 
illustrate  this,  let  us  consider  the  problem  of  partitioning 
the  vertices  of  graphs  and  in  Figure  3.5.  Graph  G^  is 
a rotation  of  graph  G^  by  180  degree.  Shown  in  Table  3.7  is 
a list  of  the  surrounding  simple  cycle  vector  for  each 
vertex  in  the  graph.  A summary  of  the  vertex  label  simple 
cycle  vectors  for  the  vertices  in  graph  G^  could  be  written 
and  shown  in  Table  3.8,  in  which  each  vertex  is  associated 
with  its  corresponding  vertex  label  simple  cycle  vector. 


Table  3.7 


A Summary  of  Surrounding  Simple  Cycle  Vector  for 
Graph  G^  in  Figure  3.5. 


Vertex 

Surrounding  Simple  Cycle  Vector 

A 

[ (ABC) ] 

B 

[ (ABC) , (BDC) , (DBFHE) ] 

C 

[(ABC) , (BDC) ] 

D 

[ (BDC) , (DBFHE) ] 

E 

[ (DBFHE) ] 

F 

[ (EBFHE) , (FGH) ] 

G 

[(FGH) ] 

H 

[ (DBFHE) , (FGH) ] 

Table  3.8  A Summary  of  Vertex  Label  Simple  Cycle  Vector  for 
Graph  G^  in  Figure  3.5. 


Vertex 

Vertex  Label  Simple  Cycle  Vector 

A 

[(111) ] 

B 

[ (021) , (111) , (131) ] 

C 

[ (021) , (111) ] 

D 

[ (021) , (131) ] 

E 

[ (131) ] 

F 

[ (120) , (131) ] 

G 

[ (120) ] 

H 

[ (120) , (131) ] 
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If  the  same  analysis  is  done  for  graph  , the 
surrounding  simple  cycle  vectors  and  vertex  label  simple 
cycle  vectors  for  the  vertices  in  graph  could  be  written 
and  shown  in  Tables  3.9  and  3.10. 


Table  3.9 


A Summary  of  Surrounding  Simple  Cycle  Vector  for 
Graph  G^  in  Figure  3.5. 


Vertex 

Surrounding  Simple  Cycle  Vector 

a 

[ (abc) , (cadeg) ] 

b 

[ (abc) ] 

c 

[ (abc) , (cadeg) ] 

d 

[ (cadeg) ] 

e 

[ (cadeg) , (egf) ] 

f 

[ (egf) , (ghf) ] 

g 

[ (cadeg) , (egf) , (ghf] 

h 

[ (ghf) ] 

Table  3.10 


A Summary  of  Vertex  Label  Simple  Cycle  Vector 
for  Graph  G^  in  Figure  3.5. 


Vertex 

Vertex  Label  Simple  Cycle  Vector 

a 

[(120) , (131) ] 

b 

[ (120) ] 

c 

[ (120) , (131) ] 

d 

[ (131) ] 

e 

[ (021) , (131) ] 

f 

[(021) , (111) ] 

g 

[ (021) , (111) , (131] 

h 

[(111) ] 

Partitioning  the  vertex  sets  of  graphs  G^  and  G^  in 
Figure  3.5  by  considering  each  vertex  label  simple  cycle 
vector  as  a single  decimal  number,  the  generated 
partitioning  result  is: 
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A,G,E,C,D, (F,H) ,B 
h,b,d,f,e, (a,c) ,g 

3. 1.4. 4.  Edge  label  simple  cycle  vector 

The  edge  label  simple  cycle  vector  of  a subject  vertex 
is  obtained  by  substituting  the  edge  label  of  each  edge  to 
every  simple  cycle  in  the  surrounding  simple  cycle  vector  of 
the  vertex,  then  enumerating  the  edge  label  distribution  for 
each  simple  cycle  in  the  surrounding  simple  cycle  vector  and 
reordering  the  generated  numeric  sequence  in  ascending  order 
if  we  treat  each  edge  label  distribution  as  a single  decimal 
number.  In  the  edge  label  simple  cycle  vector  of  vertex  x, 
the  first  element  giving  the  edge  label  distribution  of  the 
first  simple  cycle  surrounding  vertex  x,  the  second  element 
giving  the  edge  label  distribution  of  the  second  simple 
cycle  surrounding  vertex  x,  and  so  on.  The  length  of  each 
element  in  the  edge  label  simple  cycle  vector  is  equal  to 
the  number  of  edge  labels  in  the  graph.  For  instance, 
consider  vertex  B in  graph  of  Figure  3.5.  The 

surrounding  simple  cycle  vector  of  vertex  B is  [ (ABC) , (BDC) , 
(DBFHE) ] . Since  the  number  of  edge  labels  in  the  graph  is 
3,  the  length  of  each  element  in  the  edge  label  simple  cycle 
vector  should  be  3.  The  first  simple  cycle  in  the 
surrounding  simple  cycle  vector  of  vertex  B is  (ABC) , the 
edge  label  for  each  pair  of  vertices  in  the  simple  cycle  are 
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as  follows;  AB  is  I3,  BC  is  1^,  and  CA  is  I3 . Thus  the  edge 
label  distribution  of  the  first  simple  cycle  is  (101) . The 
second  simple  cycle  in  the  surrounding  simple  cycle  vector 
of  vertex  B is  (BDC)  , the  edge  label  for  each  pair  of 
vertices  in  the  simple  cycle  are  as  follows;  BD  is  I2,  DC  is 
I2,  and  CB  is  I3.  Thus  the  edge  label  distribution  of  the 
second  simple  cycle  is  (120) . The  third  simple  cycle  in  the 
surrounding  simple  cycle  vector  of  vertex  B is  (DBFHE) , the 
edge  label  for  each  pair  of  vertices  in  the  simple  cycle  are 
as  follows;  DB  is  I2,  BF  is  1^,  FH  is  I2,  HE  is  I3,  and  ED 
is  I2.  Thus  the  edge  label  distribution  of  the  third  simple 
cycle  is  (131) . If  we  consider  the  edge  label  distribution 
of  each  simple  cycle  as  a single  decimal  number  and  reorder 
the  generated  numeric  sequence,  the  edge  label  simple  cycle 
vector  of  vertex  B is  then  obtained,  which  is  [ ( 101) , ( 120) , 
(131) ] . 

The  edge  label  simple  cycle  vector  can  be  used  as  a 
vertex  invariant  in  the  vertex  partitioning  for  the  graphs 
under  rotation,  translation,  and  scale  changes.  Consider 
two  vertices  Vj.  and  Vg  belonging  to  graphs  and 

respectively.  Graph  G^  is  a rotation,  translation,  or  scale 
changes  of  graph  G^.  An  isomorphism  of  G^  onto  G^  can  map 
V;^  onto  Vg  only  if  the  edge  label  simple  cycle  vector  of  Vj- 
equals  the  edge  label  simple  cycle  vector  of  Vg.  To 
illustrate  this,  let  us  consider  the  problem  of  partitioning 
the  vertices  of  graphs  G^  and  G^  in  Figure  3.5.  Graph  G^  is 
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a rotation  of  graph  by  180  degree.  A summary  of  the  edge 
label  simple  cycle  vectors  for  the  vertices  in  graph 
could  be  written  and  shown  in  Table  3.11,  in  which  each 
vertex  is  associated  with  its  corresponding  edge  label 
simple  cycle  vector. 


Table  3.11 


A Summary  of  Edge  Label  Simple  Cycle  Vector  for 
Graph  G^  in  Figure  3.5. 


Vertex 

Edge  Label  Simple  Cycle  Vector 

A 

[(102) ] 

B 

[(102), (120), (131)] 

C 

[(102) , (120)] 

D 

[ (120) , (131) ] 

E 

[ (131) ] 

F 

[ (030)  , (131) ] 

G 

[ (030) ] 

H 

[ (030) , (131)  ] 

If  the  same  analysis  is  done  for  graph  G^ , the  summary 
of  edge  label  simple  cycle  vector  for  the  vertices  in  graph 
G^  could  be  written  and  shown  in  Table  3.12. 


Table  3.12 


A Summary  of  Edge  Label  Simple  Cycle  Vector  for 
Graph  G^  in  Figure  3.5. 


Vertex 

Edge  Label  Simple  Cycle  Vector 

a 

[(030) , (131) ] 

b 

[(030) ] 

c 

[ (030) , (131) ] 

d 

[(131) ] 

e 

[ (120) , (131) ] 

f 

[(102) , (120)] 

g 

[(102) , (120) , (131] 

h 

[(102) ] 
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Partitioning  the  vertex  sets  of  graphs  and  in 

Figure  3.5  by  considering  each  edge  label  simple  cycle 
vector  as  a single  decimal  number,  the  generated 
partitioning  result  is: 

G,A,E, (F,H) ,C,D,B 
b,h,d, (a,c) ,f ,e,g 

3.1.5  Vertex  Partitioning  Using  Simple  Cycle  Invariants 

The  simple  cycle  invariants  derived  in  this  section  are 
invariant  under  pictorial  rotation,  reflection,  and  shear  of 
planar  graphs.  Therefore,  they  can  be  used  as  vertex 
invariants  to  partition  the  vertex  sets  of  a pair  of 
cyclically  involved  planar  graphs. 

Since  the  vertex  label  and  edge  label  are  employed  in 
the  generation  of  the  vertex  label  simple  cycle  vector  and 
edge  label  simple  cycle  vector,  the  information  contained  in 
the  vertex  label  simple  cycle  vector  and  edge  label  simple 
cycle  vector  is  finer  than  that  of  the  surrounding  simple 
cycle  number  and  simple  cycle  length  vector.  The  vertices 
have  the  same  surrounding  simple  cycle  number  and  simple 
cycle  length  vector  may  not  have  the  same  vertex  label 
simple  cycle  vector  or  edge  label  simple  cycle  vector. 
Therefore,  the  vertex  label  simple  cycle  vector  and  edge 
label  simple  cycle  vector  can  be  employed  to  refine  the 
vertex  cell  sets  generated  by  the  surrounding  simple  cycle 
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number  and  simple  cycle  length  vector.  For  instance, 
consider  the  vertex  cell  sets  generated  by  applying  the 
simple  cycle  length  vector  to  the  graphs  in  Figure  3.5. 

E, (A,G) , (D,F,H) ,C,B 
d, (b,h) , (a,c,e) ,f,g 

There  are  two  cells  with  more  than  one  vertex  in  the 
vertex  cell  sets,  one  with  two  vertices  and  the  other  with 
three  vertices.  If  we  apply  the  vertex  label  simple  cycle 
vector  to  the  above  vertex  cell  sets,  the  new  vertex  cell 
set  generated  is 


E,A,G,D, (F,H) ,C,B 
d,h,b,e, (a,c)  ,f,g 

Notice  that  in  the  new  vertex  cell  set  only  one  cell 
has  more  then  one  vertex  and  the  number  of  vertices  in  that 
cell  has  been  reduced  from  three  to  two. 

If  the  application  of  all  simple  cycle  invariants  still 
can  not  find  the  one-to-one  vertex  correspondence,  consider 
the  simple  cycle  invariants  of  the  neighboring  region 
surrounding  the  vertices  to  be  resolved.  The  principle  can 
be  stated  as  follows;  if  graphs  G^  and  G^  are  isomorphic  and 
vertex  x with  neighbors  {Vj^}  in  graph  G^  corresponds  to 
vertex  x*  with  neighbors  {v*j^}  in  G^ , then  the  set  of  simple 
cycle  invariants  for  {v^)  in  G^  must  be  the  same  as  the  set 
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of  simple  cycle  invariants  for  {v*j^}  in  . If  the 
neighboring  region  still  can  not  resolve  the  vertex 
partitioning,  consider  the  next  regions  further  away,  and 
then  beyond  those  until  finally  to  delineate  the  one-to-one 
vertex  correspondence.  The  method  to  resolve  the 
ambiguities  for  the  graphs  under  translation,  rotation,  or 
scale  changes  can  be  implemented  by  utilizing  the  simple 
cycle  invariants  together  with  the  distance  list  which  will 
be  discussed  in  Chapter  5. 

3.2  Generation  of  Cycle  Invariants 

A cycle  is  defined  as  an  alternative  sequence  of 
vertices  and  edges  which  are  adjacent  with  no  vertices 
appearing  more  than  once  except  the  starting  vertex  and 
terminal  vertex  which  are  the  same  vertex  and  occurs  exactly 
twice  in  the  sequence.  The  cycle  invariants  derived  based 
on  the  cycle  analysis  can  be  used  as  vertex  invariants  in 
the  vertex  partitioning  for  the  graphs  under  isomorphism. 

Several  cycle  extraction  algorithms  have  been  proposed 
for  the  past  decade.  Gibbs  [1969]  proposed  a method  of 
finding  every  cycle  of  an  undirected  linear  graph  from  a 
finite  set  of  fundamental  cycles.  Smith  [1986]  presented  an 
algorithm  for  searching  the  depth-first  spanning  trees  of 
planar  graphs,  which  has  been  used  as  a basis  for  many  other 
graph-theoretic  algorithms.  The  actual  construction  of  the 
depth-first  spanning  tree  made  crucial  use  of  the  PARTITION 
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result,  which  regarded  each  mesh  cycle  as  a cycle  of  the 
planar  graph. 

All  the  cycle  extraction  algorithms  presented  so  far, 
extract  all  cycles  of  a graph  and  use  them  as  a basis  for 
other  graph-theoretic  algorithms.  Our  purpose  here  is  to 
derive  a cycle  length  vector  which  is  generated  based  on 
the  extracted  cycles  surrounding  a given  vertex  of  a graph. 
It  is  clear  that  extracting  all  cycles  of  a graph  and  then 
deriving  the  cycle  length  vector  from  the  extracted  cycles 
is  not  suitable  for  programming  since  the  number  of  cycles 
of  a graph  is  much  larger  than  the  number  of  cycles 
surrounding  any  given  vertex  of  the  graph.  Thus,  it  is 
desirable  to  have  an  algorithm  which  will  generate  the 
cycles  surrounding  a given  vertex  directly  without  first 
finding  every  cycle  of  the  graph.  In  response  to  this  need, 
a cycle  extraction  algorithm  for  extracting  cycles 
surrounding  a vertex  is  proposed.  The  algorithm  is  more 
suitable  for  programming  because  the  set  of  cycles 
surrounding  a vertex  is  generally  much  smaller  than  the  set 
of  all  cycles  contained  in  the  graph. 

3.2.1  Cycle  Extraction  Algorithm 

The  proposed  cycle  extraction  algorithm  extracts  cycles 
surrounding  a vertex  by  expanding  the  cycle  tree  of  the 
vertex.  The  strategy  will  be  to  develop  a cycle  tree  of  the 
vertex  using  a breadth-first  search  of  the  graph. 
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We  use  the  fact  that  a cycle  is  a simple  and  closed  path 
to  derive  an  algorithm  for  finding  cycles  containing  a given 
vertex.  The  cycles  surrounding  other  vertices  of  the  graph 
can  be  found  by  repeated  use  of  the  algorithm  for  each 
vertex. 

The  algorithm  begins  by  growing  a breadth-first  cycle 
tree  from  the  vertex  of  interest.  The  nodes  of  the  tree  are 
then  visited  in  breadth-first  search  order.  When  a node  is 
visited,  the  path  is  then  examined  to  determine  if  the  path 
can  be  identified  as  a cycle. 

We  can  state  the  cycle  extraction  algorithm  as  follows: 
Algorithm  3-2 : 

Step  1:  Choose  the  subject  vertex  as  the  root  vertex  to 
expand  the  cycle  tree. 

Step  2:  Generate  a path  by  selecting  the  active  neighboring 
vertex  which  is  expanded  first  based  on  the  breadth-first 
search  order.  All  vertices  are  initially  active.  Trace  the 
generated  path  of  the  tree  to  check  whether  a vertex  has 
been  traced  twice.  If  so,  the  path  does  not  represent  a 
cycle.  Inactivate  the  expanded  vertex  and  repeat  step  2. 
Otherwise,  a cycle  is  identified.  Inactivate  the  expanded 
vertex  and  go  to  step  3 . 

Step  3:  Record  the  extracted  cycle.  Concatenate  the  cycle- 
list  of  the  extracted  cycle  to  the  cycle-list  of  previously 
extracted  cycles. 
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Step  4:  Check  all  vertices  in  the  current  level  of  the  tree 
to  see  if  any  of  them  is  active.  If  so,  go  to  step  2. 
Otherwise,  all  cycles  surrounding  the  subject  vertex  have 
been  extracted.  Terminate  the  process. 

Only  cycles  which  contain  a valid  path  from  the  endpoint 
to  the  root  of  the  tree  will  be  added  to  the  cycle-list. 
The  cycles  surrounding  other  vertices  of  the  graph  can  be 
found  by  repeated  use  of  the  algorithm  for  each  vertex. 

Figure  3.7  will  present  the  details  of  how  Algorithm  3-2 
works  for  vertex  A of  the  graph  shown  in  Figure  3.6.  In  the 
cycle  tree  of  vertex  A as  shown  in  Figure  3.7,  the  paths 
with  the  marks  represent  the  identified  cycles,  the 
paths  with  the  'X'  marks  represent  the  paths  violating  the 
constraint,  and  the  paths  with  the  ' = ' marks  represent  the 
redundant  cycles  which  have  been  identified.  The  extracted 
cycles  surrounding  vertex  A are  (ABC) , (ABDC) , and  (ABEFC) . 

3.2.2  Surrounding  Cycle  Vector 

The  surrounding  cycle  vector  of  a vertex  is  the 
concatenation  of  all  cycles  surrounding  the  vertex.  For 
instance,  consider  vertex  A in  the  graph  of  Figure  3.6. 
There  are  three  cycles  surrounding  vertex  A;  they  are  (ABC) , 
(ABDC) , and  (ABEFC) . The  surrounding  cycle  vector  of  vertex 
A is  then  obtained  by  concatenating  the  cycles  surrounding 
vertex  A,  which  is  [ (ABC) , (ABDC) , (ABEFC) ] . 
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Figure  3 . 6 Graph  3.6. 
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Figure  3.7 


Cycle  Tree  for  Vertex  A in  Graph  3.6. 
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3.2.3  Cycle  Invariants 

The  cycle  invariants  are  properties  of  a vertex  which 
are  invariant  under  graph  isomorphism.  Four  cycle 
invariants  are  proposed.  They  are  (1)  the  surrounding  cycle 
number,  (2)  the  cycle  length  vector,  (3)  the  vertex  label 
cycle  vector,  and  (4)  the  edge  label  cycle  vector. 

3 . 2 . 3 . 1 . Surrounding  cycle  number 

The  surrounding  cycle  number  of  a vertex  is  the  number 
of  cycles  surrounding  the  vertex.  The  surrounding  cycle 
number  of  vertex  x can  be  obtained  by  counting  the  number  of 
cycles  in  the  surrounding  cycle  vector  of  vertex  x.  For 
instance,  referring  to  the  graph  in  Figure  3.6,  the 
surrounding  cycle  number  of  vertex  A is  3 , which  is  obtained 
by  counting  the  number  of  cycles  in  the  surrounding  cycle 
vector  of  vertex  A. 

The  surrounding  cycle  number  can  be  used  as  a vertex 
invariant  in  the  vertex  partitioning  for  the  graphs  under 
isomorphism.  Consider  two  vertices  Vj.  and  Vg  belonging  to 
graphs  and  respectively.  An  isomorphism  of  G^  onto  G^ 
can  map  Vj.  onto  Vg  only  if  the  number  of  cycles  surrounding 
V;^  equals  the  number  of  cycles  surrounding  Vg.  To 
illustrate  this,  let  us  consider  the  problem  of  partitioning 
the  vertices  of  graphs  G^  and  G^  in  Figure  3.8.  Graph  G^  is 
isomorphic  to  graph  G^.  Algorithm  3-2  produces  the  set  of 
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L2 

Figure  3.8  A Pair  of  Graphs  Illustrating  the  Usefulness 

of  Cycle  Invariants  in  the  Vertex  Partitioning 
Process . 
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cycles  surrounding  each  vertex  of  graph  from  which  the 
surrounding  cycle  vector  of  the  vertex  is  obtained.  Shown 
in  Table  3.13  is  a list  of  the  surrounding  cycle  vector  for 
each  vertex  in  the  graph.  A summary  of  the  surrounding 
cycle  numbers  for  the  vertices  in  graph  could  be  written 
and  shown  in  Table  3.14,  in  which  each  vertex  is  associated 
with  its  corresponding  surrounding  cycle  number. 


Table  3.13 


A 

G^ 


summary  of  Surrounding  Cycle  vector  for  Graph 
in  Figure  3.8. 


Vertex 

Surrounding  Cycle  Vector 

A 

[ (ABC) , (ABDC) , (ABFHEDC) , (ABFGHEDC) ] 

B 

[ (BAC) , (BCD) , (BACD) , (BDEHF) , (BCDEHF) , (BDEHGF) , 
(BACDEHF) , (BCDEHGF) , (BACDEHGF) ] 

C 

[ (CAB) , (CBD) , (CABD) , (CBFHED) , (CABFHED) , 
(CBFGHED) , (CABFGHED) ] 

D 

[ (DBC) , (DBAC) , (DBFHE) , (DBFGHE) , (DCBFHE) , 
(DCABFHE) , (DCBFGHE) , (DCABFGHE) ] 

E 

[ (EDBFH) , (EDBFGH) , (EDCBFH) , (EDCABFH) , (EDCBFGH) , 
(EDCABFGH) ] 

F 

[ (FGH) , (FBDEH) , (FBCDEH) , (FBDEHG) , (FBACDEH) , 
(FBCDEHG) , (FBACDEHG) ] 

G 

[ (GFH) , (GFBDEH) , (GFBCDEH) , (GFBACDEH) ] 

H 

[ (HFG) , (HEDBF) , (HEDBFG) , (HEDCBF) , (HEDCABF) , 
(HEDCBFG) , (HEDCABFG) ] 
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Table  3.14 


Vertex 

Surrounding  Cycle  Number 

A 

4 

B 

9 

C 

7 

D 

8 

E 

6 

F 

7 

G 

4 

H 

7 

If  the  same  analysis  is  done  for  graph  , the 
surrounding  cycle  vectors  and  surrounding  cycle  numbers  for 
the  vertices  in  graph  could  be  written  and  shown  in 
Tables  3.15  and  3.16. 


Table  3.15 


Vertex 

Surrounding  Cycle  Vector 

a 

[ (ade) , (abhcd) , (abghcd) , (abhcde) , (abfghcd) , 
(abghcde) , (abfghcde) ] 

b 

[ (bfg) , (bgh) , (bfgh) , (badch) , (badchg) , (baedch) , 
(badchgf ) , (baedchg) , (baedchgf) ] 

c 

[ (cdabh) , (cdabgh) , (cdeabh) , (cdabfgh) , (cdeabgh) , 
(cdeabfgh) ] 

d 

[ (dae) , (dabhc) , (dabghc) , (deabhc) , (dabfghc) , 
(deabghc) , (deabfghc) ] 

e 

[ (ead) , (eabhcd) , (eabghcd) , (eabfghcd) ] 

f 

[ (fbg) , (fbhg) , (f badchg) , (f baedchg) ] 

g 

[ (gbf) , (gbh) , (gfbh) , (gbadch) , (gbaedch) , (gfbadch) 
(gfbaedch) ] 

h 

[ (hbg) , (hbfg) , (hbadc) , (hbaedc) , (hcdabg) , 
(hcdabfg) , (hcdeabg) , (hcdeabfg) ] 
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Table  3.16 


A Summary  of  Surrounding  Cycle  Number  for  Graph 
in  Figure  3.8. 


Vertex 

Surrounding  Cycle  Number 

a 

7 

b 

9 

c 

6 

d 

7 

e 

4 

f 

4 

g 

7 

h 

8 

Partitioning  the  vertex  sets  of  graphs  G^  and  G^  in 
Figure  3 . 8 based  on  the  surrounding  cycle  number  of  each 
vertex,  the  generated  partitioning  result  is: 


(A,G) ,E, (C,F,H) ,D,B 
(e, f) ,c, (a,d,g) ,h,b 


3 . 2 . 3 . 2 . Cycle  length  vector 


The  length  of  a cycle  is  defined  as  the  number  of  edges 
contained  in  the  cycle.  This  is  of  course,  equal  to  the 
umber  of  vertices  contained  in  this  cycle.  For  each  vertex 
in  the  graph,  one  first  enumerates  cycles  of  different 
length  surrounding  the  vertex  and  constructs  a sequence  of 
cycle  length  numbers  for  the  vertex.  The  cycle  length 
vector  of  vertex  k in  graph  G is  defined  as  vc(k)  = 
(n3  , n4  , n5 , • • • , nj  ) , where  nj^  equals  the  number  of  cycles  of 
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length  i in  which  vertex  k is  contained  and  j is  the  longest 
cycle  length  in  the  surrounding  cycle  vectors  of  all 
vertices  in  graph  G.  Enumeration  of  the  cycle  length  vector 
of  a vertex  is  straightforward:  one  simply  considers  each 
vertex  separately  and  forms  a sequence  of  numbers,  the  first 
element  giving  the  number  of  length  3 cycles  surrounding  the 
vertex  (since  the  minimum  length  of  a cycle  is  three) , the 
second  element  giving  the  number  of  length  4 cycles 
surrounding  the  vertex,  and  so  on.  For  instance,  consider 
vertex  C in  graph  of  Figure  3.8.  There  are  seven  cycles 
surrounding  vertex  C.  They  are  (CAB) , (CBD) , (CABD) , 
(CBFHED) , (CABFHED) , (CBFGHED) , and  (CABFGHED) , with  length 
3,  3,  4,  6,  7,  7,  and  8,  respectively.  Since  the  longest 
cycle  length  in  the  surrounding  cycle  vectors  of  all 
vertices  in  the  graph  is  8,  the  cycle  length  vector  of 
vertex  C is  (n3,n4,n5,n5,n7,n8) , the  first  element  giving 
the  number  of  length  3 cycles  surrounding  vertex  C,  the 
second  element  giving  the  number  of  length  4 cycles 
surrounding  vertex  C,  and  so  on.  There  are  2 cycles  with 
length  3,  1 cycle  with  length  4,  1 cycle  with  length  6,  2 
cycles  with  length  7 , and  1 cycle  with  length  8 surrounding 
vertex  C.  Therefore,  the  cycle  length  vector  of  vertex  C is 
(2, 1,0, 1,2,1)  . 

The  cycle  length  vector  can  be  used  as  a vertex 
invariant  in  the  vertex  partitioning  for  the  graphs  under 
isomorphism.  Consider  two  vertices  Vj-  and  Vg  belonging  to 
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graphs  and  respectively.  An  isomorphism  of  G^  onto  G^ 
can  map  Vj.  onto  Vg  only  if  the  cycle  length  vector  of  v^ 
equals  the  cycle  length  vector  of  Vg.  To  illustrate  this, 
let  us  consider  the  problem  of  partitioning  the  vertices  of 
graphs  G^  and  G^  in  Figure  3.8.  Graph  G^  is  isomorphic  to 
graph  G^.  A summary  of  the  cycle  length  vectors  for  the 
vertices  in  graph  G^  could  be  written  and  shown  in  Table 
3.17,  in  which  each  vertex  is  associated  with  its 
corresponding  cycle  length  vector. 

Table  3.17  A Summary  of  Cycle  Length  Vector  for  Graph  G^ 
in  Figure  3.8. 


Vertex 

Cycle  Length  Vector 

A 

(1,1, 0,0, 1,1) 

B 

(2, 1,1, 2, 2,1) 

C 

(2, 1,0, 1,2,1) 

D 

(1,1, 1,2, 2,1) 

E 

(0,0, 1,2, 2,1) 

F 

(1,0, 1,2, 2,1) 

G 

(1,0, 0,1, 1,1) 

H 

(1,0, 1,2, 2,1) 

If  the  same  analysis  is  done  for  graph  G^ , a summary  of 
the  cycle  length  vectors  for  the  vertices  in  graph  G^  could 
be  written  and  shown  in  Table  3.18. 
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Table  3.18  A Summary  of  Cycle  Length  Vector  for  Graph 
in  Figure  3.8. 


Vertex 

Cycle  Length  Vector 

a 

(1,0, 1,2, 2,1) 

b 

(2, 1,1, 2, 2,1) 

c 

(0,0, 1,2, 2,1) 

d 

(1,0, 1,2, 2,1) 

e 

(1,0, 0,1, 1,1) 

f 

(1,1, 0,0, 1,1) 

g 

(2, 1,0, 1,2,1) 

h 

(1,1, 1,2, 2,1) 

Partitioning  the  vertex  sets  of  graphs  G^  and  G^  by 
considering  each  cycle  length  vector  as  a single  decimal 
number,  the  generated  partitioning  result  is: 

E,G, (F,H) ,A,D,C,B 
c,e, (a,d) ,f,h,g,b 

3 . 2 . 3 . 3 . Vertex  label  cycle  vector 

The  vertex  label  cycle  vector  of  a subject  vertex  is 
obtained  by  substituting  the  vertex  label  of  each  vertex  to 
the  surrounding  cycle  vector  of  the  subject  vertex,  then 
enumerating  the  vertex  label  distribution  for  each  cycle  in 
the  surrounding  cycle  vector  and  reordering  the  generated 
numeric  sequence  in  ascending  order  if  we  treat  each  vertex 
label  distribution  as  a single  decimal  number.  In  the 
vertex  label  cycle  vector  of  vertex  x,  the  first  element 
giving  the  vertex  label  distribution  of  the  first  cycle 
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surrounding  vertex  x,  the  second  element  giving  the  vertex 
label  distribution  of  the  second  cycle  surrounding  vertex  x, 
and  so  on.  The  length  of  each  element  in  the  vertex  label 
cycle  vector  is  equal  to  the  number  of  vertex  labels  in  the 
graph.  For  instance,  consider  vertex  C in  graph  of 

Figure  3.8.  The  surrounding  cycle  vector  of  vertex  C is 
[ (CAB)  , (CBD)  , (CABD)  , (CBFHED)  , (CABFHED)  , (CBFGHED)  , 
(CABFGHED) ] . Substituting  the  vertex  label  of  each  vertex 
into  the  surrounding  cycle  vector  of  vertex  C,  the  result 
generated  is  [ (L2L1L3)  , (L2L3L2)  , (L2L]^L3L2)  , (L2L3L2L2L1L2)  , 
( L2  L1L3  L2  L2  L1L2  ) / ( L2L3L2L3L2L3L2  ) , (L2L3L3L2Lq^L2L3L2  ) ] • 
Since  the  number  of  vertex  labels  in  the  graph  is  3,  the 
length  of  each  element  in  the  vertex  label  cycle  vector  is 
3.  By  enumerating  the  vertex  label  distribution  of  each 
cycle  and  reordering  the  generated  numeric  seguence,  the 
vertex  label  cycle  vector  of  vertex  C is  then  obtained, 
which  is  [ (021) , (111) , (121) , (141) , (241) , (241) , (341) ] . 

The  vertex  label  cycle  vector  can  be  used  as  a vertex 
invariant  in  the  vertex  partitioning  for  the  graphs  under 
isomorphism.  Consider  two  vertices  Vj-  and  Vg  belonging  to 
graphs  G^  and  G^  respectively.  An  isomorphism  of  G^  onto  G^ 
can  map  Vj-  onto  Vg  only  if  the  vertex  label  cycle  vector  of 
Vj.  eguals  the  vertex  label  cycle  vector  of  Vg.  To 

illustrate  this,  let  us  consider  the  problem  of  partitioning 
the  vertices  of  graphs  G^  and  G^  in  Figure  3.8.  Graph  G^  is 
isomorphic  to  graph  G^.  A summary  of  the  vertex  label  cycle 


109 


vectors  for  the  vertices  in  graph  could  be  written  and 
shown  in  Table  3.19,  in  which  each  vertex  is  associated  with 
its  corresponding  vertex  label  cycle  vector. 


Table  3.19 


A Summary  of  Vertex  Label  Cycle  Vector  for  Graph 
G^  in  Figure  3.8. 


Vertex 

Vertex  Label  Cycle  Vector 

A 

[ (111) , (121) , (241) , (341) ] 

B 

[ (021) , (111) , (121) , (131) , (141) , (231) , (241) , 
(241) , (341) ] 

C 

[ (021) , (111) , (121) , (141) , (241) , (241) , (341) ] 

D 

[ (021) , (121) , (131) , (141) , (231) , (241) , (241) , 
(341)] 

E 

[ (131) , (141) , (231) , (241) , (241)  , (341) ] 

F 

[ (120) , (131) , (141) , (231) , (241) , (241) , (341) ] 

G 

[ (120) , (231) , (241) , (341) ] 

H 

[ (120) , (131) , (141) , (231) , (241) , (241) , (341) ] 

If  the 
the  vertex 


same  analysis  is  done  for  graph  G^ , a summary  of 
label  cycle  vectors  for  the  vertices  in  graph  G^ 


could  be  written  and  shown  in  Table  3.20. 
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Table  3.20 


A Summary  of  Vertex  Label  Cycle  Vector  for  Graph 
in  Figure  3.8. 


Vertex 

Vertex  Label  Cycle  Vector 

a 

[(120) , (131) , (141) , (231) , (241) , (241) , (341) ] 

b 

[ (021) , (111) , (121) , (131) , (141) , (231) , (241) , 
(241)  , (341) ] 

c 

[ (131) , (141) , (231) , (241) , (241) , (341) ] 

d 

[(120) , (131) , (141) , (231) , (241) , (241) , (341) ] 

e 

[(120) , (231) , (241) , (341) ] 

f 

[(111) , (121) , (241) , (341) ] 

g 

[ (021)  , (111)  , (121)  , (141)  , (241)  , (241)  , (341) ] 

h 

[(021) , (121) , (131) , (141) , (231) , (241) , (241) , 
(341) ] 

Partitioning  the  vertex  sets  of  graphs  G^  and  G^  in 
Figure  3.8  by  considering  each  vertex  label  cycle  vector  as 
a single  decimal  number,  the  generated  partitioning  result 
is: 


A,G,E,C, (F,H) ,D,B 
f ,e,c,g, (a,d) ,h,b 


3. 2. 3. 4.  Edge  label  cycle  vector 


The  edge  label  cycle  vector  of  a subject  vertex  is 
obtained  by  substituting  the  edge  label  of  each  edge  to 
every  cycle  in  the  surrounding  cycle  vector  of  the  vertex, 
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then  enumerating  the  edge  label  distribution  for  each  cycle 
in  the  surrounding  cycle  vector  and  reordering  the  generated 
numeric  sequence  in  ascending  order  if  we  treat  each  edge 
label  distribution  as  a single  decimal  number.  In  the  edge 
label  cycle  vector  of  vertex  x,  the  first  element  giving  the 
edge  label  distribution  of  the  first  cycle  surrounding 
vertex  x,  the  second  element  giving  the  edge  label 
distribution  of  the  second  cycle  surrounding  vertex  x,  and 
so  on.  The  length  of  each  element  in  the  edge  label  cycle 
vector  is  equal  to  the  number  of  edge  labels  in  the  graph. 
For  instance,  consider  vertex  A in  graph  of  Figure  3.8. 
The  surrounding  cycle  vector  of  vertex  A is  [ (ABC) , (ABDC) , 
(ABFHEDC) , (ABFGHEDC) ] . Since  the  number  of  edge  labels  in 
the  graph  is  3 , the  length  of  each  element  in  the  edge  label 
cycle  vector  should  be  3.  The  first  cycle  in  the 
surrounding  cycle  vector  of  vertex  A is  (ABC) , the  edge 
label  for  each  pair  of  vertices  in  the  cycle  are  as  follows; 
AB  is  I3,  BC  is  1-^,  and  CA  is  I3.  Thus  the  edge  label 
distribution  of  the  first  cycle  is  (102) . The  second  cycle 
in  the  surrounding  cycle  vector  of  vertex  A is  (ABDC) , the 
edge  label  for  each  pair  of  vertices  in  the  cycle  are  as 
follows;  AB  is  I3,  BD  is  I2,  DC  is  I2,  and  CA  is  I3.  Thus 
the  edge  label  distribution  of  the  second  cycle  is  (022) . 
The  third  cycle  in  the  surrounding  cycle  vector  of  vertex  A 
is  (ABFHEDC)  , the  edge  label  for  each  pair  of  vertices  in 
the  cycle  are  as  follows;  AB  is  I3,  BF  is  I3,  FH  is  I2,  HE 


112 


is  I3,  ED  is  I2,  DC  is  I2,  and  CA  is  I3.  Thus  the  edge 
label  distribution  of  the  third  cycle  is  (133) . The  fourth 
cycle  in  the  surrounding  cycle  vector  of  vertex  A is 
(ABFGHEDC) , the  edge  label  for  each  pair  of  vertices  in  the 
cycle  are  as  follows;  AB  is  I3,  BE  is  1^,  EG  is  I2,  GH  is 
I2/  HE  is  I3,  ED  is  I2,  DC  is  I2 , and  CA  is  I3.  Thus  the 
edge  label  distribution  of  the  fourth  cycle  is  (143) . If  we 
consider  the  edge  label  distribution  of  each  cycle  as  a 
single  decimal  number  and  reorder  the  generated  numeric 
seguence,  the  edge  label  cycle  vector  of  vertex  A is  then 
obtained,  which  is  [ (022) , (102) , (133) , (143) ] . 

The  edge  label  cycle  vector  can  be  used  as  a vertex 
invariant  in  the  vertex  partitioning  for  the  graphs  under 
isomorphism.  Consider  two  vertices  V;^  and  Vg  belonging  to 
graphs  G^  and  G^  respectively.  An  isomorphism  of  G^  onto  G^ 
can  map  Vj-  onto  Vg  only  if  the  edge  label  cycle  vector  of  v^. 
equals  the  edge  label  cycle  vector  of  Vg.  To  illustrate 
this,  let  us  consider  the  problem  of  partitioning  the 
vertices  of  graphs  G^  and  G^  in  Figure  3.8.  Graph  G^  is 
isomorphic  to  graph  G^.  A summary  of  the  edge  label  cycle 
vectors  for  the  vertices  in  graph  G^  could  be  written  and 
shown  in  Table  3.21,  in  which  each  vertex  is  associated 
with  its  corresponding  edge  label  cycle  vector. 
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Table  3.21 


A Summary  of  Edge  Label  Cycle  Vector  for  Graph 
in  Figure  3.8. 


Vertex 

Edge  Label  Cycle  Vector 

A 

[(022)  , (102)  , (133) , (143) ] 

B 

[ (022)  , (102)  , (120)  , (131)  , (133)  , (141)  , (143) , 
(231) , (241) ] 

C 

[ (022) , (102)  , (120) , (133)  , (143)  , (231) , (241) ] 

D 

[ (022)  , (120)  , (131) , (133)  , (141)  , (143)  , (231) , 
(241) 3 

E 

[(131) , (133) , (141) , (143) , (231) , (241) ] 

F 

[ (030) , (131)  , (133)  , (141)  , (143)  , (231)  , (241) ] 

G 

[ (030) , (141) , (143) , (241) ] 

H 

[ (030) , (131)  , (133)  , (141) , (143) , (231)  , (241) ] 

If  the  same  analysis  is  done  for  graph  G^ , a summary  of 
the  edge  label  cycle  vectors  for  the  vertices  in  graph  G^ 
could  be  written  and  shown  in  Table  3.22. 

Partition  the  vertex  sets  of  graphs  G^  and  G^  in  Figure 
3.8  by  considering  each  edge  label  cycle  vector  as  a single 
decimal  number,  the  generated  partitioning  result  is: 


A,G,E,C, (F,H) ,D,B 
f ,e,c,g, (a,d) ,h,b 
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Table  3.22 


A Summary  of  Edge  Label  Cycle  Vector  for  Graph 
in  Figure  3.8. 


Vertex 

Edge  Label  Cycle  Vector 

a 

[ (030) , (131) , (133) , (141) , (143) , (231) , (241) ] 

b 

[ (022) , (102) , (120) , (131) , (133) , (141) , (143) , 
(231) , (241) ] 

c 

[(131) , (133) , (141) , (143) , (231) , (241) ] 

d 

[ (030) , (131) , (133) , (141) , (143) , (231) , (241) ] 

e 

[(030),  (141),  (143),  (241)] 

f 

[ (022)  , (102)  , (133) , (143) ] 

g 

[ (022) , (102)  , (120) , (133)  , (143)  , (231)  , (241) ] 

h 

[(022) , (120)  , (131) , (133)  , (141)  , (143)  , (231)  , 
(241) ] 

3.2.4  Vertex  Partitioning  Using  Cycle  Invariants 

The  cycle  invariants  derived  in  this  section  are 
invariant  under  graph  isomorphism.  Therefore,  they  can  be 
used  to  partition  the  vertex  sets  of  a pair  of  cyclically 
involved  graphs  to  reduce  the  number  of  possible  vertex 
mappings. 


CHAPTER  IV 

INVARIANT  GENERATION  BASED  ON  CANONICAL  ENCODING 

A canonical  encoding  algorithm  for  the  generation  of 
canonical  representation  for  a graph  is  presented  in  this 
chapter.  The  canonical  encoding  problem  is  to  design  such  a 
scheme  which  will  apply  to  any  graph  so  that  the  rules  give 
a unique  canonical  representation.  The  canonical  code 
which  is  a sufficient  condition  for  the  existence  of  an 
isomorphism  and  the  canonical  string  which  is  a necessary 
condition  for  the  existence  of  an  isomorphism  are  then 
derived  based  on  the  encoded  canonical  representation.  The 
canonical  code  and  canonical  string  of  the  unknown  graph  is 
them  matched  against  the  canonical  codes  and  canonical 
strings  of  all  known  graphs.  If  any  two  graphs  have 

identical  canonical  codes,  then  we  can  conclude  that  the  two 
graphs  are  isomorphic  the  vertex  correspondence  between  the 
vertex  sets  of  these  two  graphs  are  obtained  by  pairing  the 
final  canonical  orderings  of  the  two  graphs.  As  to  the 
canonical  string,  it  can  be  used  as  an  invariant  in  the 
isomorphism  test  to  eliminate  those  non-isomorphic  graphs  at 
early  steps. 

The  underlying  idea  behind  the  canonical  encoding 
technique  to  determine  whether  or  not  two  graphs  G^  and  G^ 
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are  isomorphic  is  by  keeping  fixed  and  reordering 
vertices  of  to  check  if  their  adjacency  matrices  become 
identical . Permutations  of  rows  and  corresponding  columns 
imply  reordering  the  vertices.  It  must  be  noted,  however, 
that  the  rows  and  columns  must  be  arranged  in  the  same 
order.  Thus  if  two  rows  are  interchanged  in  the  adjacency 
matrix,  the  corresponding  columns  must  also  be  interchanged. 
Hence  two  graphs  G^  and  G^  are  isomorphic  if  their  adjacency 
matrices  differ  only  by  permutations  of  rows  and  columns. 

The  basic  approach  of  the  canonical  encoding  technigue 
is  to  represent  an  n-vertex  graph  by  an  n x n adjacency 
matrix.  Each  row  is  taken  as  a decimal  number.  The  rows 
and  columns  of  this  matrix  are  then  rearranged  until  the 
rows  are  sorted  in  ascending  order.  This  is  done  by 
starting  from  the  top  of  the  matrix  and  exchanging  each  row 
with  the  one  below  it,  if  it  has  a larger  numerical  value. 
When  two  rows  are  exchanged,  the  corresponding  columns  are 
also  exchanged.  This  is  done  until  no  more  exchanges  take 
place. 

Several  authors  (Nagle  [1966],  Harary  [1974],  Shah, 
Davida,  and  McCarthy  [1974],  Randic  [1974])  have 
independently  discovered  a method  for  associating  a unique 
number  with  a given  graph  G,  which  characterizes  G.  The 
device  which  accomplishes  the  encoding  is  the  assignment  of 
a unique  labeling  to  the  vertices  of  G so  that  the  resulting 
adjacency  matrix  A is  in  a certain  canonical  form.  The 
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first  alternative  procedure  suggested  that  by  pairwise 
permutations  of  rows  and  columns  one  will  arrive  at  the 
absolute  minimum,  but  as  we  know  now,  the  search  can  be 
locked  in  a local  minimum  (Mackay  [1975]).  The  second 
procedure  starts  with  a completely  unlabeled  graph  (in 
contrast  to  the  first  approach  when  one  assumes  an 
arbitrary  labeling  as  initial)  . Then,  in  order  to  have  in 
the  first  row  as  small  a number  as  the  situation  allows,  one 
assigns  the  smallest  labels  to  vertices  of  the  lowest 
degree,  as  their  adjacent  labels  combine  the  largest 
available  labels.  The  process  is  carried  out  stepwise,  one 
label  at  a time.  Since  the  already  assigned  labels  may 
dictate  the  future  assignments,  this  particular  algorithm 
appears  to  be  less  suited  for  computer  implementation, 
certainly  less  simple  than  the  former. 

Most  of  the  earlier  works  use  the  simple  adjacency 
matrix  which  gives  no  indication  of  the  vertex  and  edge 
features  that  are  present  in  the  graph.  It  is  also  not  easy 
to  constrain  the  row  and  column  exchanges.  According  to 
Mackay  [1975]  these  methods  will  sometimes  fail  to  yield  an 
absolute  minimum  for  a given  graph  configuration.  It  is 
also  possible  for  the  process  to  enter  a cycle.  These  two 
conditions  are  due  to  the  fact  that  an  exchange  of  columns 
following  an  exchange  of  rows  might  require  that  rows 
already  arranged  be  rearranged;  i.e.,  exchanging  two  columns 


118 


can  change  the  index  number  (if  we  consider  each  row  of  the 
matrix  as  a single  decimal  number)  of  the  rows  involved. 

The  canonical  encoding  algorithm  presented  in  this 
chapter  attempts  to  overcome  the  problems  associated  with 
the  basic  approach.  Ordering  the  rows  by  the  number  of 
nonzero  entries  rather  than  by  the  size  of  the  index  number 
will  result  in  an  algorithm  that  always  terminates  because 
column  exchanges  following  row  exchanges  cannot  affect  the 
ordering  of  the  rows  that  have  already  arranged;  i.e.,  the 
number  of  nonzero  entries  in  every  row  will  always  remain 
the  same . 

Since  the  vertex  features  and  edge  features  uniquely 
specify  the  local  properties  of  the  vertices  and  the 
relationships  among  the  vertices  in  the  graph,  they  can  be 
utilized  to  select  which  row  should  precede  the  other  in  the 
rearrangement  process.  While  previous  approaches  do  not 
reflect  the  vertex  features  and  edge  features  in  the 
adjacency  matrix,  our  approach  employs  the  vertex  and  edge 
features  in  the  generated  modified  adjacency  matrix.  It  is 
thus  possible  to  more  easily  discriminate  between  any  two 
vertices. 


4.1  Modified  Adjacency  Matrix 


A graph  can  be  described  in  terms  of  its  primitive  parts 
(i.e.,  the  vertex  labels)  and  the  relationships  (i.e.,  the 
edge  labels)  between  them.  The  vertex  label  is  the  vertex 
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feature  associated  with  each  vertex  in  a graph  and  the  edge 
label  is  the  edge  feature  associated  with  each  edge  in  a 
graph.  In  this  section,  we  will  present  a matrix  called 
modified  adjacency  matrix  for  the  representation  of  a graph. 
This  matrix  requires  that  all  the  possible  vertex  labels 
that  could  be  present  be  identified,  along  with  all  the 
possible  edge  labels  that  could  connect  any  two  vertices  be 
identified.  In  this  representation,  attached  to  each  vertex 
is  a vertex  label  drawn  from  a set  of  vertex  features  (such 
as  degree,  vertex  type) . These  vertex  labels  represent  the 
primitives  of  the  graph  being  represented.  Attached  to  each 
edge  is  also  an  edge  label  that  is  drawn  from  a set  of  edge 
features  (such  as  edge  type) . For  example,  the  graph  of 
Figure  4.1  can  be  described  as  follows: 

VLa  = L2 
VLb  = L3 
VLc  = L2 
VLd  = Li 
VLe  = L3 
= ^2 
elac  = I2 
ELaD  = ll 
= ^2 
ELcd  = ll 
ELce  = ll 
“ ^2 
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Vertex 

Vertex  Label 

A 

L2 

B 

L3 

C 

L2 

D 

Ll 

E 

L3 

Figure  4.1  Graph  4.1. 
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where  denotes  the  vertex  label  of  vertex  i and  ELj^j 

denotes  the  edge  label  of  edge  ij . The  vertex  label  and 
edge  label  vertices  constitute  the  set  of  label  vertices. 
The  vertices  of  the  graph  formed  by  the  label  vertices  and 
non-label  vertices  are  L2,  L3 , 1^,  I2,  A,  B,  C,  D,  and 

E,  where  the  first  five  are  the  label  vertices  and  the  last 
five  are  the  non-label  vertices. 

The  description  of  a graph  is  input  in  the  form  shown 
above,  in  which  each  input  line  represents  a vertex  in  the 
representative  graph.  The  elements  in  the  row 
corresponding  to  each  vertex  are  ordered  by  vertex  labels 
and  edge  labels  and  then  by  connectivity.  The  description 
as  given  is  then  reduced  into  the  modified  adjacency  matrix. 
The  modified  adjacency  matrix  for  an  n-vertex  graph  is  the 
(n+ri+r2)  x (n+r3^+r2)  matrix  as  illustrated  in  Table  4.1, 
where  r^  is  the  number  of  vertex  labels,  and  r2  is  the 
number  of  edge  labels.  To  ensure  that  the  modified 
adjacency  matrix  is  in  some  initial  order,  the  description 
is  sorted,  first  by  the  label  vertices  and  then  by  the  non- 
label vertices.  In  this  matrix  the  vertex  label  vertices 
and  edge  label  vertices  are  sorted  in  descending  numerical 
or  alphabetic  order.  For  any  vertex,  vertex  u,  the  element 
in  the  vertex  label  sectors  a(rl+r2+u)p  = 1 if  the  vertex 
label  of  vertex  u is  Lp,  otherwise  a(rl+r2+u)p  “ 
element  in  the  edge  label  sectors  a (rl+r2+u)  (rl+q)  ” dlqC’^)  / 
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Table  4.1  Modified  Adjacency  Matrix. 


Li  1 

L2  2 


Lri  ri 

11  ri+1 

12  ri+2 


^r2  ^1+^2 


vi 

V2 


'n 


ri+r2+l 

ri+r2+2 


ri+r2+n 


Table  4.2  Modified  Adjacency  Matrix  for  Graph  4.1. 


Ll  L2  L3 
12  3 


ll  I2  A 
4 5 6 


B C 

7 8 


D E 
9 10 


A 6 


B 7 
C 8 
D 9 
E 10 


0 0 
0 0 
0 0 
0 0 
0 0 
0 1 
0 0 
0 1 
1 0 
0 0 


0 0 
0 0 
0 0 
0 0 
0 0 
0 1 
1 0 
0 2 
0 2 
1 1 


0 0 
0 0 
0 0 
0 0 
0 0 
2 0 
2 1 
1 1 
2 1 
1 0 


0 0 
0 0 
0 0 
0 0 
0 0 
1 1 
0 0 
0 0 
1 1 
0 1 


0 0 
0 0 
0 0 
0 0 
0 0 
1 0 
1 0 
1 1 
0 1 
1 0 
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which  is  the  number  of  edges  with  edge  label  Ig  that  are 
incident  with  vertex  u.  Thus,  the  description  for  the  graph 
in  Figure  4.1  will  be  represented  by  the  modified  adjacency 
matrix  as  shown  in  Table  4.2  with  the  initial  vertex 
ordering  A,B,C,D,E. 

4 . 2 Canonical  Encoding  Algorithm 

In  principle,  the  rows  of  the  modified  adjacency  matrix 
are  considered  as  a series  of  numbers.  The  aim  is  to 
reorder  rows  and  columns  until  the  matrix  from  top  to  bottom 
represents  the  rows  in  ascending  order  if  each  row  is 
considered  as  being  a number.  The  canonical  encoding 
problem  is  then  formulated  as  the  matrix  minimization 
problem.  It  is  performed  by  attempting  to  move  down  rows 
with  the  largest  number  of  nonzero  entries  while  moving  up 
those  with  the  least  number  of  nonzero  entries.  To  ensure 
that  the  matrix  minimization  process  will  always  terminate, 
we  consider  the  row  with  the  largest  number  of  nonzero 
entries  to  be  the  largest.  Vertices  with  the  largest 
numbers  of  nonzero  entries  should  have  as  many  edges  as 
possible  to  ensure  that  the  corresponding  rows  will  have  as 
many  nonzero  entries  as  possible.  Clearly  the  above 
reguirement  on  the  largest  number  of  nonzero  entries  in  the 
last  row  implies  that  the  last  row  can  only  be  associated 
with  the  vertices  of  the  largest  degree  possible.  The 
process  is  carried  out  in  stages  until  no  more  movements  are 
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possible.  When  two  rows  are  exchanged,  the  corresponding 
columns  are  also  exchanged. 

It  should  be  pointed  out  at  this  stage  that  this  matrix 
minimization  process  will  always  terminate  and  that  it  will 
always  generate  a unique  canonical  matrix  for  every  graph. 
This  is  obvious  for  the  case  where  every  vertex  has  a 
different  number  of  connections  from  all  the  other  vertices. 

The  more  common  situation,  however,  is  that  there  is 
more  than  one  vertex  with  the  same  number  of  connections. 
The  first  constraint  that  is  applied  in  such  a case  is  that 
of  the  vertex  label.  It  is  applied  to  discriminate  between 
vertices  with  the  same  number  of  connections.  The  row  with 
the  larger  numeric  value  is  treated  as  the  larger  row. 

The  second  constraint  that  is  applied  is  that  of  the 
edge  label.  This  constraint  is  used  to  discriminate  between 
vertices  with  the  same  number  of  connections  and  the  same 
vertex  label.  The  row  with  the  larger  numeric  is  treated  as 
the  larger  row. 

When  two  vertices  both  have  the  same  number  of 
connections  and  the  same  vertex  label  and  edge  label,  then 
the  current  largest  row  is  selected.  In  this  case,  it  does 
not  really  matter  which  one  is  chosen  since  the  vertices  are 
identical . 

The  process  is  now  repeated  considering  the  rest  of  the 
graph  and  the  remaining  exchanges  in  the  same  manner,  the 
next  exchange  is  to  exchange  the  pair  of  rows  so  that  the 
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second  to  last  row  of  the  modified  adjacency  matrix,  i.e., 
n+ri+r2“l,  is  as  large  as  possible.  Hence,  one  ignores  the 
already  assigned  vertex  (the  last  row)  and  examines  the 
remainder  of  the  vertices.  With  the  completion  of  the 
assignment  of  the  second  exchange,  one  proceeds  in  the  same 
way  until  no  more  exchanges  occur. 

In  this  manner,  a row  is  selected  to  exchange  during 
each  pass  of  the  matrix  until  there  is  no  longer  any  need 
for  exchange.  Each  time  a row  that  is  selected  and  put  in 
the  appropriate  place  in  the  reordered  matrix  is  no  longer 
exchanged.  Hence,  the  process  will  always  terminate.  This 
method  is  therefore  different  from  those  that  attempt  the 
minimization  without  the  constraints  that  we  have  applied. 

The  proposed  matrix  minimization  algorithm  can  be 
stated  as  follows; 

Algorithm  4-1 

Step  1:  Select  the  row  with  the  largest  number  of  nonzero 
entries  and  exchange  it  with  the  last  row.  If  two  or  more 
than  two  rows  with  the  same  number  of  nonzero  entries, 
choose  the  one  with  the  largest  value.  When  two  rows  are 
exchanged,  the  corresponding  columns  are  also  exchanged. 

Step  2:  Consider  the  rest  of  the  graph  and  the  remaining 
rows  in  the  same  manner,  the  next  exchange  is  to  exchange 
the  pair  of  rows  so  that  the  next  row  of  the  modified 
adjacency  matrix  preceding  to  the  most  recently  exchanged 
row  is  as  large  as  possible. 
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Step  3:  With  the  completion  of  the  assignment  of  the  next 
exchange,  one  proceeds  in  the  same  way  until  no  more 
exchanges  take  place. 

The  matrix  generated  by  the  matrix  minimization 
algorithm  is  called  the  canonical  matrix  of  the  graph. 

Notice  that  the  rows  representing  the  label  vertices  are 
not  included  in  the  minimization  process.  These  label 
vertices  remain  invariant  throughout  the  minimization 
process  as  they  are  not  reordered.  The  label  vertices  are 
used  to  constrain  the  reordering  process.  When  two  rows 
contain  the  same  number  of  nonzero  entries,  selecting  the 
one  with  the  larger  value  is,  in  effect,  using  the  vertex 
label  and  edge  label  to  discriminate  between  the  two.  This 
ensures  that  the  resulting  canonical  matrix  is  in  the 
canonical  form. 

To  illustrate  the  formation  of  the  canonical  matrix,  let 
us  consider  the  graph  in  Figure  4.1  with  the  modified 
adjacency  matrix  shown  in  Table  4.2.  In  this  initial 
matrix,  the  row  with  the  largest  number  of  nonzero  entries 
is  row  9.  Thus,  rows  (and  columns)  9 and  10  are  exchanged, 
and  the  new  matrix  is  shown  in  Table  4.3. 

At  the  next  iteration,  only  the  first  nine  rows  are 
examined.  Of  these,  row  6 and  row  8 both  contain  6 nonzero 
entries.  Row  8 is  selected  for  exchange  with  row  9 as  it 
contains  the  larger  value.  It  is  exchanged  with  row  9. 
The  new  matrix  is  shown  in  Table  4.4. 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 


127 


Table  4 . 3 


1 2 


3 4 


5 6 7 8 9 10 


0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 10  12 
0 0 10  2 
0 10  2 1 
0 0 111 
1 0 0 2 2 


0 

0 

0 

0 

0 

0 

1 

1 

0 

1 


0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
110  1 
0 0 0 1 
0 0 11 
0 10  1 
1110 


Table  4.4 


1 2 3 4 5 

0 0 0 0 0 

0 0 0 0 0 

0 0 0 0 0 

0 0 0 0 0 

0 0 0 0 0 

0 10  12 

0 0 10  2 

0 0 111 

0 10  2 1 

1 0 0 2 2 


6 7 8 9 10 

0 0 0 0 0 

0 0 0 0 0 

0 0 0 0 0 

0 0 0 0 0 

0 0 0 0 0 

0 10  11 
1 0 0 0 1 

0 0 0 1 1 

10  10  1 
11110 


128 


In  this  matrix,  the  first  eight  rows  are  now  examined. 
The  row  with  the  largest  number  of  nonzero  entries  is  row  6. 
Thus  row  6 is  selected  for  exchange  with  row  8 . The  new 
matrix  is  shown  in  Table  4.5. 

Row  6 and  7 are  now  exchanged  based  on  the  above 
criterion.  The  new  matrix  is  shown  in  Table  4.6. 

At  this  stage  no  exchanges  are  possible.  This  is  the 
canonical  matrix  of  the  graph  which  has  been  transformed 
into  the  canonical  form.  The  canonical  ordering  of  the 
vertices  is  the  final  vertex  ordering  in  the  canonical 
matrix,  which  is  B,E,A,C,D. 

4.3  Complexity  Analysis  of  Canonical  Encoding  Algorithm 
In  the  matrix  minimization  process,  each  time  a row  that 


is  selected  and 

put 

in  the  appropriate 

place  in 

the 

reordered  matrix 

is 

no  longer  exchanged. 

Hence, 

the 

canonical  encoding  algorithm  will  always  terminate. 

Assume  the  number  of  vertices  of  the  graph  is  n.  The 
problem  of  row  and  column  permutation  is  of  the  same  nature 
as  the  sorting  problem.  At  the  first  iteration,  the 
algorithm  requires  (n-1)  comparisons.  At  the  second 
iteration,  the  algorithm  requires  (n-2)  comparisons. 
Therefore,  the  algorithm  should  requires  n*(n-l)/2 
comparisons  and  thus  should  take  a time  roughly  proportional 
to  n^  (Donovan  [1979]). 
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Table  4.5 


12  3 4 


5 6 7 8 9 10 


0 

0 

0 

0 

0 

0 

0 

0 

0 

1 


0 

0 

0 

0 

0 

0 

0 

1 

1 

0 


0 

0 

0 

0 

0 

1 

1 

0 

0 

0 


0 

0 

0 

0 

0 

1 

0 

1 

2 
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0 

0 

0 

0 

0 

1 

2 

2 

1 
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0 

0 

0 

0 

0 

0 

0 

0 
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0 

0 

0 

0 

0 

0 

0 

1 

0 

1 


0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

oil 
10  1 
oil 
10  1 
110 


Table  4.6 


1 2 3 4 5 

0 0 0 0 0 

0 0 0 0 0 

0 0 0 0 0 

0 0 0 0 0 

0 0 0 0 0 

0 0 10  2 

0 0 111 

0 10  12 

0 10  2 1 

1 0 0 2 2 


6 7 8 9 10 

0 0 0 0 0 

0 0 0 0 0 

0 0 0 0 0 

0 0 0 0 0 

0 0 0 0 0 

0 0 10  1 
0 0 0 1 1 

10  0 11 

0 110  1 

11110 
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4.4  Canonical  Code 

A decimal  number  sequence  called  the  canonical  code  can 
be  extracted  from  the  canonical  matrix  which  determines  the 
canonical  matrix  uniquely  and  hence  also  determines  the 
graph.  The  canonical  matrix  is  scanned  to  turn  the  matrix 
into  an  ordered  numeric  string.  This  is  done  by  assigning 
each  row  a decimal  number  and  then  doing  a raster  scan  on 
the  matrix.  In  fact  it  can  be  thought  of  as  working  with 
the  n decimal  numbers  which  represent  the  n rows.  Each  of 
the  n!  different  numbering  of  the  graph  thus  gives  a 
different  code.  It  is  proposed  to  make  that  numbering  which 
gives  the  smallest  code  to  be  the  canonical  code  of  the 
graph . 

Referring  to  the  graph  in  Figure  4.1,  its  canonical  code 
is  (10200101,11100011,101210011,102101101,1002211110),  which 
is  extracted  from  the  canonical  matrix  as  shown  in  Table 

4.6. 

The  canonical  encoding  algorithm  allows  all  the  known 
graphs  to  be  compiled  into  the  canonical  code  only  once. 
Subsequently,  only  the  unknown  input  graph  need  be  compiled 
into  its  canonical  code.  This  canonical  code  is  then 
matched  against  stored  canonical  codes.  Since  the 
equivalence  of  canonical  code  is  a sufficient  condition  for 
the  existence  of  an  isomorphism,  we  can  conclude  that  any 
two  graphs  are  isomorphic  if  the  two  graphs  have  identical 
canonical  code  and  the  one-to-one  vertex  correspondence 
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between  the  vertex  sets  of  these  two  graphs  can  be  obtained 
by  pairing  the  final  canonical  orderings  of  the  two  graphs. 
To  illustrate  this,  let  us  consider  the  graph  in  Figure  4.2 
which  is  isomorphic  to  the  graph  in  Figure  4.1.  The 
modified  adjacency  matrix  of  the  graph  is  shown  in  Table 
4.7.  The  matrix  minimization  process  is  shown  in  Tables  4.8 
and  4.9.  Shown  in  Table  4.9  is  the  canonical  matrix  of  the 
graph.  The  canonical  ordering  of  the  vertices  is  the  final 
vertex  ordering  in  the  canonical  matrix,  which  is  c,d,a,b,e. 
The  canonical  code  of  the  graph  is  then  obtained  from  its 
canonical  matrix,  which  is  (10200101,11100011,101210011, 
102101101,1002211110).  Notice  that  the  canonical  codes  of 
the  graphs  in  Figure  4.1  and  Figure  4.2  are  identical. 
Therefore,  we  can  conclude  that  these  two  graphs  are 
isomorphic  and  the  vertex-to-vertex  correspondence  between 
these  two  graphs  are  obtained  by  pairing  the  vertices 
according  to  the  canonical  orderings  of  the  two  graphs  which 
is 


B,  E, A, C, D 
c,d,a,b,e 

It  should  be  pointed  out  at  this  stage  that  the 
equivalence  of  canonical  code  is  only  a sufficient  condition 
for  the  existence  of  an  isomorphism.  Therefore  if  two 
graphs  do  not  have  identical  canonical  codes,  it  doesn't 
mean  that  these  two  graphs  are  not  isomorphic.  For 
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a 


Figure  4.2  Graph  4.2. 


Table  4.7  Modified  Adjacency  Matrix  for  Graph  4.2. 


Li 

1 


L2 

2 


L3 

3 


ll 

4 


l2 

5 


abode 
6 7 8 9 10 


Ll  1 


a 6 


b 7 
c 8 
d 9 
e 10 


0 0 
0 0 
0 0 
0 0 
0 0 
0 1 
0 1 
0 0 
0 0 
1 0 


0 0 
0 0 
0 0 
0 0 
0 0 
0 1 
0 2 
1 0 
1 1 
0 2 


0 0 
0 0 
0 0 
0 0 
0 0 
2 0 
1 1 
2 1 
1 0 
2 1 


0 0 
0 0 
0 0 
0 0 
0 0 
1 1 
0 0 
0 0 
1 0 
1 1 


0 0 
0 0 
0 0 
0 0 
0 0 
0 1 
1 1 
0 1 
0 1 
1 0 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 
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Table  4.8 


1 2 


3 4 


5 6 7 8 9 10 


0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 10  1 
0 0 11 
0 0 10 
0 10  2 
10  0 2 


0 0 
0 0 
0 0 
0 0 
0 0 
2 0 
1 0 
2 1 
1 1 
2 1 


0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 111 
0 0 11 
0 0 0 1 
10  0 1 
1110 


Table  4.9 


1 2 3 4 5 

0 0 0 0 0 

0 0 0 0 0 

0 0 0 0 0 

0 0 0 0 0 

0 0 0 0 0 

0 0 10  2 

0 0 111 

0 10  12 

0 10  2 1 

1 0 0 2 2 


6 7 8 9 10 

0 0 0 0 0 

0 0 0 0 0 

0 0 0 0 0 

0 0 0 0 0 

0 0 0 0 0 

0 0 10  1 
0 0 0 1 1 

10  0 11 

0 110  1 

11110 
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instance,  consider  a pair  of  isomorphic  graphs  as 
illustrated  in  Figure  4.3  and  Figure  4.4  with  the  modified 
adjacency  matrices  shown  in  Table  4.10  and  4.11 
respectively.  Shown  in  Table  4.12  and  Table  4.13  are  the 
canonical  matrices  for  these  two  graphs.  The  canonical 
codes  of  these  two  graphs  are  (10200011,10200110,10201100, 
101211001,101210110)  and  (10200011,10200101,10201010, 
101210101,101210011).  Notice  that  the  canonical  codes  of 
these  two  graphs  are  not  identical,  however,  they  are  still 
isomorphic.  To  verify  if  two  given  graphs  are  not 
isomorphic,  we  have  to  check  their  canonical  strings  which 
will  be  discussed  in  next  section. 

4.5  Canonical  String 

The  canonical  string  is  obtained  by  scanning  and 
compressing  the  canonical  matrix  to  turn  the  matrix  into  an 
ordered  string.  This  is  done  by  assigning  each  vertex  a 
token  and  then  doing  a raster  scan  on  the  canonical  matrix. 
The  label  vertices  are  given  distinct  identifying  label 
token,  while  the  other  vertices  are  given  a common 
unspecified  token. 

For  the  example  of  Figure  4 . 3 given  previously  with  the 
canonical  matrix  shown  in  Table  4.12,  the  generation  of  the 
canonical  string  is  as  follows:  vertices  1-4  are  given 
tokens  L^,  L2,  l^,  and  I2,  respectively.  All  other  vertices 
are  assigned  the  unspecified  token  X.  The  canonical  matrix 
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A 


Vertex 

Vertex  Label 

A 

Ll 

B 

L2 

C 

L2 

D 

Ll 

E 

L2 

Figure  4 . 3 Graph  4.3. 


Table  4.10  Modified  Adjacency  Matrix  for  Graph  4.3. 


^1  ^2  ^1  ^2  ^ 

1 2 3 4 5 


B C 

6 7 


D E 

8 9 


Li  1 

L2  2 

11  3 

12  4 
A 5 
B 6 
C 7 
D 8 
E 9 


0 0 
0 0 
0 0 
0 0 
1 0 
0 1 
0 1 
1 0 
0 1 


0 0 
0 0 
0 0 
0 0 
1 2 
0 2 
0 2 
1 2 
0 2 


0 0 
0 0 
0 0 
0 0 
0 1 
1 0 
1 0 
1 1 
0 0 


0 0 
0 0 
0 0 
0 0 
1 1 
0 1 
0 0 
0 0 
1 0 


0 

0 

0 

0 

0 

0 

1 

1 

1 
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a 


Figure  4.4  Graph  4.4. 


Table  4.11  Modified  Adjacency  Matrix  for  Graph  4.4. 


Li  L2  li  I2 
12  3 4 


a 

5 


b 

6 


c 

7 


d 

8 


e 

9 


a 


b 

c 

d 

e 


0 

0 

0 

0 

0 

1 

1 

1 

0 


0 

0 

0 

0 

1 

0 

0 

0 

1 


0 

0 

0 

0 

2 

2 

2 

2 

2 


0 

0 

0 

0 

0 

1 

1 

0 

1 


0 

0 

0 

0 

1 

0 

0 

1 

0 


0 

0 

0 

0 

1 

0 

0 

0 

1 


0 

0 

0 

0 

0 

1 

0 

0 

0 


0 

0 

0 

0 

1 

0 

1 

1 

1 
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Table  4.12  Canonical  Matrix  for  Graph  4.3 


1 2 


3 4 5 6 7 8 9 


Li  1 

L2  2 

11  3 

12  4 
B 5 
C 6 
E 7 
A 8 
D 9 


0 

0 

0 

0 

0 

0 

0 

1 

1 


0 

0 

0 

0 

1 

1 

1 

0 

0 


0 

0 

0 

0 

0 

0 

0 

1 

1 


0 

0 

0 

0 

2 

2 

2 

2 

2 


0 

0 

0 

0 

0 

0 

1 

1 

0 


0 

0 

0 

0 

0 

0 

0 

1 

1 


0 

0 

0 

0 

0 

1 

1 

0 

1 


0 

0 

0 

0 

1 

1 

0 

0 

1 


0 

0 

0 

0 

1 

0 

0 

1 

0 


Table  4.13  Canonical  Matrix  for  Graph  4.4 


1 2 


3 4 5 6 7 8 9 


c 


d 

b 

a 

e 


0 

0 

0 

0 

1 

1 

1 

0 

0 


0 

0 

0 

0 

0 

0 

0 

1 

1 


0 

0 

0 

0 

2 

2 

2 

2 

2 


0 

0 

0 

0 

0 

0 

0 

1 

1 


0 

0 

0 

0 

0 

0 

1 

0 

0 


0 

0 

0 

0 

0 

1 

0 

1 

0 


0 

0 

0 

0 

1 

0 

1 

0 

1 


0 

0 

0 

0 

1 

1 

0 

1 

1 
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is  scanned  starting  from  row  5.  For  this  example  the 
resulting  string  will  be 

IL22I22XIL22I22XIL22I22XIL1II12I23XIL1II12I23X 
The  equivalence  of  the  canonical  string  is  a necessary 
condition  for  the  existence  of  an  isomorphism.  Therefore  if 
two  graphs  are  to  be  isomorphic,  they  must  have  identical 
canonical  strings.  To  illustrate  this,  let  us  consider  the 
graph  in  Figure  4 . 4 which  is  isomorphic  to  the  graph  in 
Figure  4.3.  If  the  same  analysis  for  generating  the 
canonical  string  is  done  for  the  graph,  the  generated 
canonical  string  is 

IL22I22XIL22I22XIL22I22XIL1II12I23XIL1II12I23X 
Notice  that  the  canonical  string  of  these  two  graphs  are 


identical . 


CHAPTER  V 

CONNECTIVITY  ANALYSIS  AND  AMBIGUITY  RESOLUTION 

Since  the  connectivity  structure  contains  the  overall 
information  of  the  graph,  any  graph  or  vertex  can  be 
uniquely  specified  by  its  corresponding  connectivity 
structure.  To  uniquely  represent  a graph,  we  have  to  fully 
identify  the  vertex.  To  fully  identify  a vertex  we  might 
start  by  describing  its  immediate  neighboring  region 
surrounding  it.  However,  this  would  not  necessarily  specify 
it  fully  and  uniquely;  we  would  want  to  include  the  next 
regions  further  away,  and  then  beyond  those  until  finally  to 
fully  describe  this  single  vertex,  we  would  end  up  by 
describing  the  entire  graph  from  the  point  of  view  of  that 
vertex.  It  is  generally  quite  difficult  and  tedious  to 
assert  that  two  graphs  have  an  identical  connectivity.  To 
characterize  the  connectivity  structure  of  a graph,  what 
seems  to  be  needed  is  a way  of  evaluating  the  relationship 
of  a vertex  with  all  others  vertices,  or  ideally,  some  way 
to  evaluate  its  relationship  with  the  rest  of  the  graph. 
With  this  in  mind,  the  connectivity  invariants  indicating 
the  connectivity  structure  of  a graph  are  presented  in  this 
chapter  which  can  be  effectively  used  either  as  tools  in  the 
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isomorphism  test  or  as  prime  criteria  for  the  vertex 
partitioning  to  resolve  the  ambiguity. 

The  ambiguity  occurs  if  any  vertex  in  one  graph  has  more 
than  correspondents  in  the  other  graph.  In  addition  to  the 
usage  in  the  isomorphism  test,  the  information  contained  in 
the  distance  matrix  can  be  used  as  the  last-resort  method  to 
resolve  the  ambiguity. 

5.1  Distance  Matrix  and  Connectivity  Matrix 

In  this  section,  we  describe  the  algorithms  for 
generating  the  distance  matrix  and  connectivity  matrix  of  a 
graph.  The  distance  matrix  and  connectivity  matrix  are 
derived  based  on  the  connectivity  structure  of  the  graph 
and  can  be  effectively  used  either  as  tools  in  the 
isomorphism  test  or  as  prime  criteria  for  the  vertex 
partitioning.  The  characterization  of  a graph  using 
distance  matrix  and  connectivity  matrix  for  describing  the 
connectivity  relationships  between  vertices  will  provide  a 
very  high  level  of  differentiating  power. 

5.1.1  Distance  Matrix  and  Distance  List 

A distance  matrix  is  a characterization  of  a graph  which 
offers  information  on  the  relationships  between  all  vertices 
in  the  graph.  If  graph  G is  a graph  with  the  vertex-set 
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(vi,V2 , • • • ,Vn)  , then  the  distance  matrix  D is  the  n x n 
matrix  D(G)  = (d^j),  where 

dij  = min  p(vi,Vj)  (5-1) 

dj^j  indicates  the  length  of  the  shortest  path  from  vertex  i 
to  vertex  j or  the  distance  between  vertex  i and  vertex  j . 

In  this  dissertation,  all  graphs  are  assumed  to  be 
connected,  so  that  every  vertex  is  reachable  from  every 
other  vertex.  Thus,  every  element  except  the  diagonal 
element  in  the  distance  matrix  is  nonzero. 

A distance  matrix,  D,  can  be  thought  of  as  a matrix 
consisting  of  n distance  lists  (Dj^, D2 , • • • , Dj^)  , where  n is 
the  number  of  vertices  in  the  graph,  each  row  can  be  thought 
of  as  a distance  list,  one  for  each  vertex. 


Dl 

D2 


A distance  list  describes  in  linear  notation  the 
connectivity  structure  about  an  individual  vertex  through 
several  levels  of  connectivity.  It  is  a unique  expression 
of  the  connectivity  structure  within  the  neighborhood  of  the 
subject  vertex  and  is  an  attribute  of  the  vertex. 
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Generation  of  the  distance  list  for  a given  vertex  can 
be  readily  accomplished  by  direct  use  of  the  adjacency 
matrix,  which  can  be  stated  as  follows: 

Algorithm  5-1 

Step  1:  Choose  the  subject  vertex  as  the  root  node  to 
generate  a tree. 

Step  2:  Expand  the  root  node  using  the  connectivity 

information  in  the  adjacency  matrix.  The  nodes  at  the  first 
level  are  the  immediate  neighbors  of  the  root  node. 

Step  3:  Check  the  expanded  nodes  at  the  current  level  one  by 
one,  according  to  the  order  from  left  to  right,  to  see  if  it 
has  been  expanded.  If  so,  inactivate  the  node. 

Step  4.  Check  the  status  of  the  expanded  nodes  at  the 
current  level  and  record  those  active  nodes.  If  every 
vertex  has  been  expanded,  stop.  Otherwise,  go  to  step  5. 
Step  5:  Expand  those  active  nodes  to  the  next  level  then  go 
to  step  3 . 

The  tree  generated  relative  to  the  subject  vertex  v^  is 
called  the  distance  tree,  rooted  at  Vj[,  such  that  the  unigue 
tree  path  from  Vj^  to  each  tree  node  Vj  ^ Vj^  is  in  fact  a 
shortest  path  from  v^  to  Vj . 

The  distance  list  for  the  subject  vertex,  vertex  i,  is 
obtained  by  assigning  value  'j'  to  those  active  nodes  at  the 
jth  level  of  the  distance  tree. 
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Di  = Sj  j-D(j)i  (1  < j < ki) 

= 0(l)i  + 2*d(2)^  + ...  + (5-3) 

where  D(3)j^  denotes  the  j"^^-level  distance  list  for  the  set 
of  active  nodes  at  the  j^^-level  of  the  distance  tree,  1 < j 
< kj^.  kj^  is  the  last  level  in  the  distance  tree  of  vertex  i 
where  all  vertices  have  been  expanded. 

To  illustrate  the  generation  of  distance  list,  let  us 
consider  the  graph  in  Figure  5.1  with  the  adjacency  matrix 
shown  in  Table  5.1.  The  distance  tree  generated  by 
Algorithm  5-1  for  vertex  1 is  illustrated  in  Figure  5.2.  In 
Figure  5.2,  the  nodes  in  circles  represent  the  active  nodes 
at  each  level  of  the  distance  tree.  There  are  two  active 
nodes  V2 , V3  at  the  first  level  of  the  distance  tree,  two 
active  nodes  V4 , V5  at  the  second  level  of  the  distance 
tree,  and  three  active  nodes  V5,  V7,  V3  at  the  third  level 
of  the  distance  tree.  The  j^^-level  distance  list  of  vertex 
1 can  be  obtained  by  assigning  value  '1'  to  those  active 
nodes  at  the  level  of  the  distance  tree.  The  results 

generated  are  0(^)3  = (01100000),  0(^)3^  = (OOOlOlOO)  , and 

d(3)^  = (00001011).  The  distance  list  of  vertex  1 is  then 

generated  according  to  Equation  5-3. 

Di  = 1* (01100000)  + 2* (00010100)  + 3* (00001011) 

= (01123233) 
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Figure  5.1  Graph  5.1. 


Table  5.1 


Adjacency  Matrix  for  Graph  5.1. 


Vi  V2  V3  V4  V5  Vg  Vy  Vg 


vi 

^2 

V3 

V4 

V5 

V6 

V7 

Vs 


0 

1 

1 

0 

0 

0 

0 

0 


1 

0 

1 

1 

0 

1 

0 

0 


1 

1 

0 

1 

0 

0 

0 

0 


0 

1 

1 

0 

1 

0 

0 

0 


0 

0 

0 

1 

0 

0 

0 

1 


0 

1 

0 

0 

0 

0 

1 

1 


0 

0 

0 

0 

0 

1 

0 

1 


0 

0 

0 

0 

1 

1 

1 

0 
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1 


3 0 0 0 


Figure  5.2  Distance  Tree  for  Vertex  1 in  Graph  5.1. 
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The  distance  matrix  of  the  graph  is  then  obtained  by 
combining  the  distance  lists  of  the  graph,  one  for  each 
vertex.  For  instance,  consider  the  graph  as  shown  in 

Figure  5.1.  Applying  Algorithm  5-1  to  each  vertex  of  the 
graph,  the  distance  matrix  of  the  graph  is  obtained  and 
shown  in  Table  5.2,  where  = (01123233),  D2  = (10112122), 
D3  = (11012233),  D4  = (21101232),  D5  = (32210221),  D5  = 

(21222011),  D7  = (32332101),  and  Dg  = (32321110). 

5. 1.1.1  Neighbor 

The  determination  of  the  neighbors  of  a vertex  is  easily 
carried  out  on  the  basis  of  the  distance  matrix  D.  For  any 
vertex,  vertex  i,  the  m^^-level  neighbors  of  vertex  i are 
the  set  of  vertices  (Vj)  if  vertex  j is  reachable  from 
vertex  i of  distance  m.  In  terms  of  distance  matrix,  the 
mth_ievel  neighbors  of  vertex  i are  the  set  of  vertices 
whose  corresponding  elements  in  the  distance  matrix  have  the 
value  m,  i.e.,  dj^j  = m.  The  relation  "vj^  is  a m^^-level 
neighbor  of  Vj”  is  denoted  by  v^  v j . 

Vi Vj  if  dij  = m (5-4) 

For  example,  referring  to  the  graph  in  Figure  5.1  with 
the  distance  matrix  shown  in  Table  5.2,  the  first-level 
neighbors  of  vertex  1 are  {V2,V3),  the  second-level 
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Table  5.2 


Distance  Matrix  for  Graph  5.1. 


Vi  V2  V3  V4  V5  Vg  V7  Vq 


vi 

V2 

V3 

V4 

V5 

V6 

V? 

Vs 


0 112 
10  11 
110  1 
2 110 
3 2 2 1 

2 12  2 
3 2 3 3 

3 2 3 2 


3 

2 

2 

1 

0 

2 

2 

1 


2 

1 

2 

2 

2 

0 

1 

1 


3 3 
2 2 
3 3 
3 2 
2 1 
1 1 
0 1 
1 0 


Table  5.3  Connectivity  Matrix  for  Graph  5.1. 


vi 

V2 

V3 

V4 

V5 

V6 

V7 

Vs 


12  3 

2 2 3 

4 3 0 

3 2 2 

3 3 1 

2 4 1 

3 4 0 

2 2 3 

3 2 2 
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neighbors  are  and  the  third-level  neighbors  are 

{V5,V7,Vg) . 

5. 1.1. 2 Maximum  distance 

Let  us  define  the  maximum  distance  of  a graph  to  be  the 
longest  distance  between  any  two  vertices  in  the  graph.  In 
terms  of  the  distance  matrix,  the  maximum  distance  k is 
egual  to  the  largest  element  d^j  in  the  matrix. 

k = maxi^j  dij  d^j  e D and  1 < i,j  < n (5-5) 

The  determination  of  the  maximum  distance  of  a graph  is 
easily  carried  out  on  the  basis  of  the  distance  matrix  D. 
For  example,  referring  to  the  graph  in  Figure  5.1  with  the 
distance  matrix  shown  in  Table  5.2.  Since  the  largest 
element  in  the  distance  matrix  of  the  graph  is  3 , the 
maximum  distance  of  the  graph  is  3 . 

5.1.2  Connectivity  Matrix.  Connectivity  Vector,  and 
and  Connectivity  Code 

The  connectivity  matrix  is  a unique  representation  of  a 
graph  and  it  attempts  to  characterize  each  vertex  of  the 
graph  by  its  relationship  with  ever  increasing  numbers  of 
vertices  which  are  more  distantly  connected.  The 
connectivity  matrix  of  a graph  is  obtained  by  direct  use  of 
the  distance  matrix.  Two  invariants,  the  connectivity 
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vector  and  the  connectivity  code,  are  then  derived  based  on 
the  connectivity  matrix.  The  connectivity  vector  can  be 
used  as  a vertex  invariant  in  the  vertex  partitioning.  The 
connectivity  code  can  be  used  as  a graph  invariant  in  the 
isomorphism  test. 

5. 1.2.1  Connectivity  matrix 

Assume  k is  the  maximum  distance  of  graph  G.  Define  a 
connectivity  matrix  C to  be  an  n x k matrix  C(G)  = (c^j), 
such  that  the  element  c^j  is  the  number  of  vertices  which 
are  a distance  j away  from  vertex  i.  In  terms  of  distance 
matrix,  cj^j  is  the  number  of  j's  in  the  i^^  row  of  the 
distance  matrix.  Once  the  distance  matrix  has  been 
computed,  the  derivation  of  the  connectivity  matrix  is 
simple.  For  instance,  consider  the  graph  in  Figure  5.1 
with  the  distance  matrix  shown  in  Table  5.2.  Shown  in 
Table  5.3  is  the  connectivity  matrix  of  the  graph.  In  the 
connectivity  matrix  of  the  graph  the  first  row  is  (223)  , the 
first  element  giving  the  number  of  vertices  whose  distance 
from  vertex  1 is  1,  the  second  element  giving  the  number  of 
vertices  whose  distance  from  vertex  1 is  2,  and  the  third 
element  giving  the  number  of  vertices  whose  distance  from 


vertex  1 is  3 . 


150 


5. 1.2. 2 Connectivity  vector 

Each  row  of  the  connectivity  matrix  can  be  viewed  as  a 
connectivity  vector,  one  for  each  vertex. 


Cl 

C2 


The  connectivity  vector  of  vertex  i is  the  i^^  row  in 
the  connectivity  matrix.  The  connectivity  vector  of  a 
vertex  Vj^  is  the  ordered  set  (cii,CjL2/ * * * where  each 
c^j  is  equal  to  the  number  of  vertices  and  having  distance  j 
from  Vj^. 

Vertices  which  exhibit  identical  connectivity  vector 
will  be  assigned  to  the  same  cell.  v^j^  will  map  to  v^^  in 
an  isomorphism  only  if  = c^j^,  for  all  m.  Therefore, 
the  connectivity  vector  can  be  used  as  a vertex  invariant  in 
the  vertex  partitioning.  To  illustrate  the  partitioning 
result  generated  by  the  connectivity  vector,  let  us  consider 
the  graph  in  Figure  5.3  which  is  isomorphic  to  the  graph  in 
Figure  5.1.  The  adjacency  matrix  of  the  graph  is  shown  in 
Table  5.4.  Shown  in  Table  5.5  is  the  distance  matrix  of  the 
graph,  which  is  obtained  by  applying  Algorithm  5-1  to  each 
vertex  of  the  graph.  Once  the  distance  matrix  of  the  graph 
has  been  obtained.  The  connectivity  matrix  of  the  graph  can 
be  easily  derived,  the  derived  connectivity  matrix  is  shown 


151 


V 


a 


Figure  5.3 


Graph  5.3. 


Table  5.4 


Adjacency  Matrix  for  Graph  5.3. 


Va  Vb  Vc  Vd  Vg  Vf  vg  Vh 


''a 

Vb 

Vc 

Vd 

Ve 

Vf 

Vh 


0 

1 

1 

0 

0 

0 

0 

1 


1 

0 

1 

0 

0 

0 

0 

0 


10  0 
10  0 
0 0 1 
0 0 1 
110 
0 0 0 
0 10 
0 10 


0 0 1 

0 0 0 

0 0 0 

oil 

0 0 0 

oil 
10  1 
10  1 
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Table 


Table  5 


5.5  Distance  Matrix  for  Graph  5.3. 


Va  Vb  Vc  Vd  Ve  Vf  Vg 


''a 

Vb 

Vc 

Vd 

Ve 

Vf 


0 

1 

1 

2 

2 

2 

2 

1 


1 

0 

1 

3 

2 

3 

3 

2 


1 

1 

0 

2 

1 

3 

3 

2 


2 2 
3 2 

2 1 
0 1 
1 0 
2 3 

1 2 
1 2 


2 

3 

3 

2 

3 

0 

1 

1 


2 1 
3 2 

3 2 

1 1 
2 2 
1 1 
0 1 
1 0 


6 Connectivity  Matrix  for  Graph  5.3. 


12  3 


''a 

Vb 

Vc 

Vd 

Ve 

Vf 

Vrf 


3 

2 

3 

3 

2 

2 

3 

4 


4 0 

2 3 
2 2 

3 1 

4 1 

2 3 
2 2 

3 0 
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in  Table  5.6.  The  connectivity  vectors  for  the  vertices  in 
the  graphs  of  Figure  5.1  and  Figure  5.3  are  extracted  from 
their  corresponding  connectivity  matrices,  the  results  are 
shown  in  Table  5.7  and  Table  5.8  respectively. 


Table  5.7  A Summary  of  Connectivity  Vector  for  Graph  5.1. 


Vertex 

Connectivity  Vector 

1 

223 

2 

430 

3 

322 

4 

331 

5 

241 

6 

340 

7 

223 

8 

322 

Table  5.8  A Summary  of  Connectivity  Vector  for  Graph  5.3. 


Vertex 

Connectivity  Vector 

a 

340 

b 

223 

c 

322 

d 

331 

e 

241 

f 

223 

g 

322 

h 

430 

Partition  the  vertex  sets  of  the  two  graphs  by  applying 
the  connectivity  vector  as  the  vertex  invariant,  the 
generated  partitioning  result  is 


154 


(1,7)  ,5,  (3,8)  ,4,6,2 
(b,f) ,e, (c,g) ,d,a,h 


5. 1.2. 3 Connectivity  code 

A sequence  (C^, C2 , • * • , C^)  can  be  extracted  from  the 
connectivity  matrix  by  considering  each  row  of  the  matrix  as 
a single  decimal  number.  Such  numbers  can  be  ordered 
according  to  their  magnitude,  starting  with  the  smallest 
decimal  number.  Each  number  should  be  examined  and  numbers 
interchanged  until  the  resulting  set  of  numbers  (C*j^) 
combine  to  give  the  smallest  value.  Each  connectivity 
matrix  is  then  characterized  with  a sequence  of  decimal 
numbers  (C*3^,C*2,  • • • ,C*j^)  . To  have  the  smallest  value, 
obviously  has  to  be  as  small  as  possible,  because  this 
is  the  leading  part  of  the  sequence.  This  requires  to  be 
the  smallest  number  possible,  C* 2 to  be  equal  to  or  to 
be  the  next  smallest  number  possible,  etc.  That  is  to  say 
numbers  comprising  the  sequence  (C*^)  should  be  ordered 
thus: 

C*i  < C*2  < •••  < C*n  (5-7) 

The  minimum  number  sequence  generated  by  this  procedure  is 
called  the  connectivity  code  of  the  graph.  For  example, 
consider  the  graph  in  Figure  5.1  with  the  connectivity 
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matrix  shown  in  Table  5.3.  The  connectivity  code  of  the 
graph  is  (223,223,241,322,322,331,340,430),  which  is 
generated  by  minimizing  the  number  sequence  of  the 
connectivity  matrix. 

The  problem  of  establishing  the  identity  of  graphs 
associated  with  graphs  and  is  now  converted  into  the 
problem  of  finding  out  the  identity  of  their  associated 
connectivity  codes.  If  two  graphs  are  to  be  isomorphic, 
they  must  have  the  same  connectivity  code.  Therefore  the 
connectivity  code  can  be  used  as  a graph  invariant  in  the 
isomorphism  test  to  eliminate  those  non-isomorphic  graphs. 
To  illustrate  this,  let  us  consider  the  graph  in  Figure 
5.3,  which  is  isomorphic  to  the  graph  in  Figure  5.1.  The 
adjacency  matrix  of  the  graph  is  shown  in  Table  5.4.  Shown 
in  Table  5.5  is  the  distance  matrix  of  the  graph,  which  is 

obtained  by  applying  Algorithm  5-1  to  each  vertex  of  the 

graph.  Once  the  distance  matrix  of  the  graph  has  been 
obtained.  The  connectivity  matrix  of  the  graph  can  be 
easily  derived,  the  derived  connectivity  matrix  is  shown  in 
Table  5.6.  The  connectivity  code  of  the  graph  is  then 

obtained  from  its  connectivity  matrix,  which  is 
(223,223,241,322,322,331,340,430).  Notice  that  the 

connectivity  codes  of  the  two  graphs  in  Figure  5.1  and 

Figure  5.3  are  identical,  since  these  two  graphs  are 
isomorphic. 
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5.2  Ambiguity  Resolution  Using  Relaxation  Partitioning 

While  the  partitioning  result  generated  by  the  vertex 
invariants  discussed  so  far  usually  reduces  the  size  of 
possible  vertex  mappings,  however,  it  may  not  uniquely 
characterize  the  vertices  of  the  graph.  The  ambiguity 
occurs  when  all  vertex  invariants  have  been  applied  in  the 
vertex  partitioning  and  the  one-to-one  vertex  correspondence 
still  cannot  be  found.  For  instance,  consider  the  graphs 
illustrated  in  Figure  5.4,  the  vertex  partitioning  process 
by  applying  the  edge  adjacency  vector  is  shown  in  Table  5.7. 

Table  5.7 


Vertex 

Edge  Adjacency  Vector 
for 

Edge  Adjacency  Vector 
for  g2 

1 

(1,1,0) 

(1,1,0) 

2 

(1,1,0) 

(0,1,3) 

3 

(1,1,1) 

(0,1,1) 

4 

(0,1,3) 

(1,1,0) 

5 

(0,1,1) 

(1,1,1) 

6 

(1,1,1) 

(1,1,1) 

^ (v^i,v^4) , (v^5,v^6) 


Since  the  generated  partitioning  is  not  an  isomorphic 
partitioning,  therefore  we  apply  the  connectivity  vector  to 
refine  the  partitioning.  The  vertex  partitioning  process  by 
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G‘ 


G" 


Figure  5.4  A Pair  of  Graphs  With  Ambiguity. 
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applying  the  connectivity  vector  to  the  vertex  cell  sets 
generated  by  the  edge  adjacency  vector  is  shown  in  Table 

5.8. 


Table  5.8 


Vertex 

Connectivity  Vector 
for  G^ 

Connectivity  Vector 
for  G^ 

1 

(2,2,1) 

(2,2,1) 

2 

(2,3,0) 

(4,1,0) 

3 

(3,2,0) 

(2,2,1) 

4 

(4,1,0) 

(2,3,0) 

5 

(2,2,1) 

(3,2,0) 

6 

(3,2,0) 

(3,2,0) 

v^3,v^2'V^1,v^4, (v^5,v^6) 

Since  all  vertex  invariants  have  been  applied,  it 
requires  that  if  and  are  isomorphic,  then  v^5  will  be 
mapped  to  v^3,  v^4  will  be  mapped  to  v^2/  ^^1  will  be  mapped 
to  v^]^,  and  v^2  will  be  mapped  to  v^4.  But  this  leaves  some 
uncertainty  about  the  mapping  for  vertices  v^3,  v^5,  v^5, 
and  v^5.  Even  though  the  partitioning  using  the 
connectivity  vector  refine  the  vertex  cell  sets,  it  still 
does  not  uniquely  characterize  the  vertices.  Therefore  the 
ambiguity  occurs  because  a unique  one-to-one  vertex 
correspondence  cannot  be  generated  at  this  stage. 
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5.2.1  Relaxation  Partitioning  Algorithm 

The  ambiguity  arises  from  a deficient  vertex  invariant 
class,  however,  it  can  be  avoided  if  it  is  possible  to 
discover  a property  powerful  enough  to  resolve  the  existing 
ambiguity.  A refined  procedure  which  selects  possible 
vertex  mappings  between  the  two  graphs  and  checks  these 
mappings  for  consistency  is  used  to  resolve  the  ambiguity. 
In  order  to  have  an  intuitive  justification  of  the  refined 
procedure,  let  = (V^kl^ V^k2 / * * * / ^^kp>  ^^k  = 

V^j^2  / • • • / V^kq}  equivalent  cells 

generated  by  all  vertex  invariants  on  sets  of  vertices 
and  of  and  respectively.  Consider  two  vertices 
v^kis  "'^^kit  belonging  to  and  respectively.  An 

isomorphism  of  G^  onto  G^  can  map  onto  only  if, 

for  each  vertex  having  distance  h from  there 

exists  one  vertex  v^g,  such  that  its  distance  from  is 

h and  if  v^f  belongs  to  V^]^j , v^g  belongs  to  the 
corresponding  cell  V^j^j  . 

The  implementation  of  this  refined  procedure  is  termed 
the  relaxation  partitioning.  Assume  we  have  an  ordered 
collection  of  cells  of  V,  denoted  by  V^,  V2 , 
depending  on  some  invariant  or  invariants  (such  as  the 
degree  partitioning,  the  connectivity  vector  partitioning, 
and  so  on).  To  each  vertex  x assign  a list  (ai,a2,  • • • ,aj^) 
where  a]^  (1  < k < i)  is  the  number  of  vertices  in  cell  V]^ 
that  are  adjacent  to  x with  distance  m (Initially,  m = 1)  . 
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Using  these  lists,  each  cell  may  now  be  partitioned  into 
subcells,  each  subcell  consisting  of  all  vertices  with  a 
given  list.  These  subcells  can  be  ordered  according  to  the 
lexical  ordering  of  the  corresponding  lists,  and  in  this  way 
we  may  obtain  a refinement  of  the  original  collection  of 
cells.  No  refinement  will  be  obtained  if  all  vertices  in 
each  subcell  have  identical  lists.  If  a refinement  has  been 
obtained  then  the  procedure  is  reapplied  until  the  number  of 
cells  equals  the  number  of  vertices  of  the  graph. 

The  proposed  relaxation  partitioning  algorithm  is  given 
below: 

Algorithm  5-2 

Step  1:  Assume  that  the  vertex  set  has  been  partitioned  into 
i cells  (i  > 1).  The  cells  are  ordered  according  to  their 
order  in  the  vertex  cell  set.  Let  denote  the  set  of 
vertices  in  the  cell.  To  each  vertex  y e V associate  a 

list  (a^ , a2  , • * * , aj^)  where  aj  equals  the  number  of  vertices, 

V e such  that  v y (1  < m)  . Initially  m = 1.  Note 

that  Sjaj  = d(y) , the  degree  of  vertex  y. 

Step  2:  Assume  that  V has  been  partitioned  into  i cells  (i  > 
1)  . Let  v3  denote  the  set  of  vertices  in  the  cell.  To 

each  vertex  y e V associate  a list  (a^^ , a2 , • • • , a j^)  where  aj 
equals  the  number  of  vertices,  v e v3  such  that  (y,v)  e E 
(1  < j < i) . Note  that  Sjaj  = d(y),  the  degree  of  vertex  y. 
Step  3:  We  now  define  a refinement  of  the  cell.  First 

perform  an  ordering  of  the  vertices  in  the  cell  by  examining 
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the  lists  associated  with  the  vertices  of  . Order  the 
vertices  by  lexicographically  ordering  their  lists.  Assume 
vertex  y has  list  (a]^,a2,  • • • and  vertex  z has  list 
(bi,b2 , • • • ,bi)  where  y,z  e . Suppose  aj^  = bj^,  Vh  < k;  if 
aR+i  < then  vertex  y precedes  vertex  z.  If  vertices  y 
and  z have  identical  lists,  then  the  precedence  of  the  two 
vertices  are  arbitrary. 

(1)  If  all  the  vertices  have  the  same  list,  no 
refinement  is  done. 

(2)  If  the  lists  are  not  identical,  use  the  ordering  of 
the  vertices  of  v3  to  refine  as  follows:  Assume  that 
vertex  y precedes  all  other  vertices  of  , then  the  first 
subcell  of  consists  of  vertex  y and  all  other  vertices  of 

which  have  the  same  list  as  y.  Remove  those  vertices 
from  , and  from  the  remaining  vertices  choose  the  vertex 
that  precedes  all  other  remaining  vertices.  This  vertex  and 
all  other  vertices  with  the  same  list  form  the  second 
subcell  of  . Continue  this  process  until  all  vertices  of 

belong  to  subcell  of  . 

Step  4:  Apply  step  3 to  all  i cells.  If  at  least  one  of  the 
cells  is  refined,  then  we  reindex  all  the  cells  and  go  to 
step  2.  This  reindexing  is  carried  out  as  follows:  Assume 
cells  1 to  j-1  are  not  refined,  but  cell  j is;  cell  1 to  j-1 
retain  their  cell  indices.  Index  the  k subcells  of  cell  j 
as  j,  j+1,  ••*,  j+k-1  according  to  the  ordering  of  subcells 
defined  in  step  3.  The  next  cell  to  be  indexed  (cell  j+1  or 
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a subcell  of  it)  is  assigned  the  index  j+k,  and  the  process 
continues  until  all  cells  have  been  indexed. 

Step  5:  If  the  number  of  cells  is  equal  to  the  number  of 

vertices  of  the  graph,  the  algorithm  is  finished. 
Otherwise,  increase  m by  1 and  go  to  step  1. 

To  illustrate  the  relaxation  partitioning  algorithm,  let 
us  consider  the  graph  shown  in  Figure  5.5.  Assume  the 
vertex  set  V of  the  graph  has  been  partitioned  into  a cell 
set  with  3 cells  (A,B,F,G),  (D,H) , and  (C,E) . The  ambiguity 

occurs  because  a unique  vertex  correspondence  has  not  been 
generated. 

Initially,  we  have  the  following  partitioning  of  the 
vertex  set; 

Cell  Index  Vertices 

I A,B,F,G 

II  D,H 

III  C,E 

We  now  attempt  to  refine  this  partitioning  by  assigning 
a list  to  each  vertex  such  that  the  k^^  element  is  the 
number  of  vertices  in  cell  k that  are  adjacent  to  the  vertex 
with  distance  1.  From  step  2 the  lists  are 


Cell  Index  Vertex  List 

A (1,1,0) 

B (1,0,1) 

I F (0,1,1) 

G (0,2,0) 

II  D (2,0,1) 

H (2,0,1) 

III  C (2,0,1) 

E (0,2,1) 
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A 


Figure  5.5  An  Example  Illustrating  Relaxation  Partitioning 
Technique . 
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The  lists  are 
order  is 
Cell  Index 

I 

II 

III 


lexicographically  ordered  and  the  new 


Vertex 

F 

G 

B 

A 


List 

(0,1,1) 

(0,2,0) 

(1,0,1) 

(1,1,0) 


D (2,0,1) 

H (2,0,1) 


E (0,2,1) 

C (2,0,1) 


In  step  3,  cell  3 yields  4 subcells  and  cell  3 yields  2 
subcells.  Cell  1 and  cell  3 are  refined.  In  step  4 the  new 
subcells  are  reindexed  as  follows: 


Cell  Index 

I 

II 

III 

IV 

V 

VI 

VII 


Vertices 

F 

G 

B 

A 


D,H 


E 

C 


Since  a refinement  was  performed  step  2 is  re-entered. 


The  new  lists  are  assigned  as  follows: 


Cell  Index 
I 


Vertex 

F 


II  G 

III  B 

IV  A 

V D 

H 


VI  E 

VII  C 


List 

(0,0, 0,0, 1,0,1) 

(0,0, 0,0, 2, 0,0) 

(0,0, 0,1, 0,0,1) 

(0,0, 1,0, 1,0,0) 

(0,1, 0,1, 0,0,1) 
(1,1, 0,0, 0,1,0) 

(0,0, 0,0, 2, 0,1) 

(1,0, 1,0, 0,1,0) 
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In  step  4,  cell  5 yields  2 subcells.  Therefore,  cell  5 
is  refined.  In  step  5,  the  subcells  are  reindexed  as 
follows: 


Cell  Index 

I 

II 

III 

IV 

V 

VI 

VII 

VIII 


Vertices 

F 

G 

B 

A 

D 

H 

E 

C 


Since  the  number  of  cells  is  equal  to  the  number  of 
vertices  of  the  graph,  the  algorithm  is  finished.  Notice 
that  the  ambiguity  presented  in  the  original  cell  set  has 
been  resolved. 


5.2.2  Complexity  Analysis  of  Relaxation  Partitioning 
Algorithm 

Since  the  algorithm  requires  at  least  one  refinement  in 
order  to  continue,  and  the  number  of  cells  is  bounded  by  the 
number  of  vertices  of  the  graph  (which  is  finite) , the 
algorithm  will  always  terminate. 

Assume  the  number  of  vertices  of  the  graph  is  n,  the 
number  of  cells  in  the  original  cell  set  is  m,  and  the 
largest  element  in  the  connectivity  matrix  is  h.  In  the 


worst  case.  Algorithm  5-2  lexicographically  sorts  a length  n 
sequence  of  k-tuples  (m  < k < n) , where  each  component  of  a 
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k-tuple  is  an  integer  between  0 and  h,  in  time  0((h+n)k) 
(Aho,  Hopcroft,  and  Ullman  [1974]). 


5.2.3  Ambiguity  Resolution  Using  Relaxation  Partitioning 


To  illustrate  the  process  of  ambiguity  resolution  using 
relaxation  partitioning,  let  us  consider  graph  in  Figure 
5.4.  The  vertex  cell  set  generated  by  applying  all  vertex 
invariants  to  the  graph  is  {v^5,v^4 , v^l,v^2 ^ ^ • 
From  step  1 the  lists  are 


Cell  Index 
I 


Vertex 

5 


II 


4 


III  1 

IV  2 

3 

V 6 


List 

(0,1, 0,0,1) 

(1,0, 0,1, 2) 

(0,0, 0,1,1) 

(0,1, 1,0,0) 

(0,1, 1,0,1) 
(1,1, 0,0,1) 


In  step  2 cell  4 yields  2 subcells.  Cell  4 is  refined.  In 
step  3 the  subcells  are  reindexed  as  follows: 


Cell  Index 

I 

II 

III 

IV 

V 

VI 


Vertices 

5 
4 
1 
2 
3 

6 


Since  the  cell  number  is  equal  to  the  vertex  number  of  the 
graph,  the  algorithm  is  finished.  The  final  vertex  cell  set 
is  {vl5,vl4,vli,vl2,vl3,vl6} . 
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Applying  Algorithm  5-2  to  graph  in  Figure  5.4,  from 
step  1 the  lists  are 


Cell  Index 
I 


Vertex 

3 


II 


2 


III 


1 


IV 


4 


V 


5 

6 


List 

(0,1, 0,0,1) 

(1,0, 0,1, 2) 

(0,0, 0,1,1) 

(0,1, 1,0,0) 

(0,1, 1,0,1) 
(1,1, 0,0,1) 


In  step  2 cell  4 yields  2 subcells.  Cell  4 is  refined.  In 
step  3 the  subcells  are  reindexed  as  follows: 


Cell  Index 

I 

II 

III 

IV 

V 

VI 


Vertices 

3 
2 
1 

4 

5 

6 


Since  the  cell  number  is  equal  to  the  vertex  number  of  the 
graph,  the  algorithm  is  finished.  The  final  vertex  cell  set 
is  (v23,v22,v23^,v24,v25,v2g)  . 

The  one-to-one  vertex  correspondence  is  then  obtained  by 
pairing  the  vertices  in  the  final  vertex  cell  sets.  For 
instance,  referring  to  the  graphs  in  Figure  5.4,  the  final 
vertex  cell  sets  of  the  two  graphs  are  (vi5,vi4,vi2/V^2' 
vi3,vi5)  and  { v^3  , v^2  , v^^^ , , v^5 , v^5 } respectively.  Thus, 
the  isomorphic  partitioning  of  the  two  graphs  generated  by 
the  relaxation  partitioning  algorithm  is 
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v^5'Vl4,vli,vl2,vl3,vl6 

5.2.4  Comparison  of  Ambiguity  Resolution  Scheme 

In  this  section,  we  will  make  a comparison  between  the 
ambiguity  resolution  scheme  used  in  the  previous  heuristic 
approach  and  our  approach.  To  illustrate  this,  let  us 
consider  the  problem  of  determining  whether  or  not  the  two 
molecules  in  Figure  5.6  are  isomorphic.  The  ambiguity 
exists  in  the  partitioning  result  because  a unique  one-to- 
one  vertex  correspondence  cannot  be  generated  when  all 
vertex  invariants  have  been  applied. 

The  previous  heuristic  approach  used  the  tedious 
connectivity  information  to  resolve  the  ambiguity  (Penny 
[1964],  Figueras  [1972]).  The  process  of  ambiguity 
resolution  using  first-level  connectivity  and  set  reduction 
partitioning  is  outlined  in  Table  5.9.  In  our  approach,  we 
use  the  relaxation  partitioning  technique  to  resolve 
ambiguity.  The  process  of  ambiguity  resolution  using 
relaxation  partitioning  is  outlined  in  Table  5.10.  Notice 
that  our  relaxation  partitioning  technique  needs  only  one 
iteration  to  resolve  the  ambiguity,  while  the  previous 
heuristic  approach  has  to  utilize  the  tedious  connectivity 
information  and  set  reduction  partitioning  to  resolve  the 
ambiguity.  The  advantage  of  our  relaxation  partitioning  is 
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more  prominent  when  first-level  connectivity  cannot  resolve 
the  ambiguity  and  higher-level  connectivity  has  to  be 
utilized. 
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G 


s 


Figure  5.6  Ambiguity  Resolution  Comparison  Between  Previous 
Heuristic  Approach  and  Our  Approach. 
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Table  5.9  Ambiguity  Resolution  Using  Set  Reduction. 


Features 

Corresponding 

Subsets  in 
g2 

Subset 

Number 

Vertex  Label 

C 

1,2, 4, 7 

b, e, f ,h 

1 

0 

3,6,8 

a,c,g 

2 

N 

5 

d 

3 

Degree 

One 

1,3, 5, 6 

a,d,g,h 

4 

Two 

8 

c 

5 

Three 

2,4 

e,f 

6 

Four 

7 

b 

7 

Intersection  of 

Subsets  1 and  4 

1 

h 

8 

Vertices  Adjacent 

to  1/h 

2 

f 

9 

5/d 

4 

e 

10 

7/b 

4,6,8 

a,  c,  e 

11 

Intersection  of 

Subsets  4 and  11 

6 

a 

12 

Remaining  Vertex 

3 

g 

13 

Vertex 

Correspondence 

1 

h 

2 

f 

3 

g 

4 

e 

5 

d 

6 

a 

7 

b 

8 

c 
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Table  5.10  Ambiguity  Resolution  Using  Relaxation  Partitioning. 


• Generated  cell  set  of  G^;  {1, (2,4) ,7,5, (3,6) ,8) 
Generated  cell  set  of  G^ ; (h, (e, f ) ,b,d, (a,g) ,c> 


For  Graph  G^ 

Cell  Index 

Vertex 

List 

I 

1 

(0,1, 0,0, 0,0) 

II 

2 

(1,1, 0,0, 1,0) 

4 

(0,1, 1,1, 0,0) 

III 

7 

(0,1, 0,0, 1,1) 

IV 

5 

(0,1, 0,0, 0,0) 

V 

3 

(0,1, 0,0, 0,0) 

6 

(0,0, 1,0, 0,0) 

VI 

8 

o 

o 

H 

O 

o 

o 

Cell  Index 

Vertex 

List 

I 

1 

(0,1, 0,0, 0,0) 

II 

4 

(0,1, 1,1, 0,0) 

2 

(1,1, 0,0, 1,0) 

III 

7 

(0,1, 0,0, 1,1) 

IV 

5 

(0,1, 0,0, 0,0) 

V 

6 

(0,0, 1,0, 0,0) 

3 

(0,1, 0,0, 0,0) 

VI 

8 

(0,0, 1,0, 0,0) 

Cell  Index 

Vertices 

I 

1 

II 

4 

III 

2 

IV 

7 

V 

5 

VI 

6 

VII 

3 

VIII 

8 
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For  Graph 

Cell  Index 

Vertex 

List 

I 

h 

(0,1, 0,0, 0,0) 

II 

e 

(0,1, 1,1, 0,0) 

f 

(1,1, 0,0, 1,0) 

III 

b 

(0,1, 0,0, 1,1) 

IV 

d 

(0,1, 0,0, 0,0) 

V 

a 

(0,0, 1,0, 0,0) 

g 

(0,1, 0,0, 0,0) 

VI 

c 

(0,0, 1,0, 0,0) 

Cell  Index 

Vertices 

I 

h 

II 

e 

III 

f 

IV 

b 

V 

d 

VI 

a 

VII 

g 

VIII 

c 

Vertex  Correspondence 
G^  g2 

1 

h 

2 

f 

3 

g 

4 

e 

5 

d 

6 

a 

7 

b 

8 

c 

CHAPTER  VI 
APPLICATIONS 

The  label  graph  isomorphism  algorithm  proposed  in  this 
dissertation  is  applicable  to  a large  class  of  label  graphs 
which  can  be  formulated  as  the  label  graph  isomorphism 
problem.  Given  an  object  graph  and  a set  of  model  graphs, 
the  algorithm  terminates  either  providing  the  isomorphism, 
or  indicating  that  there  is  no  isomorphism  between  the 
object  graph  and  the  model  graphs. 

The  proposed  label  graph  isomorphism  algorithm  has  been 
implemented  on  a VAX  11/750  machine  at  the  Center  for 
Information  Research.  We  have  successfully  applied  the 
algorithm  to  two  engineering  problems  which  can  be 
formulated  as  the  label  graph  isomorphism  problem.  They 
are;  (1)  the  matching  and  identification  of  circuit  diagrams 
with  PC  boards  (Tou  and  Fan  [1989]),  and  (2)  the 
representation  of  pictorial  drawings  for  3-D  recognition 
(Tou  and  Fan  [1988],  Fan  and  Tou  [1988]). 

6.1  Matching  and  Identification  of  Circuit  Diagrams 

with  PC  Boards 

The  circuit  diagram  is  a symbolic  representation  of  a 
circuit/system  design  which  is  used  as  a communication  tool 
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for  expressing  the  design  ideas  and  results  in  a pictorial 
form.  The  PC  board  of  a circuit  diagram  is  a physical 
representation  of  a circuit/system  design  which  describes 
the  actual  layouts  of  components  and  wirings  in  the  design. 
Geometrically,  the  circuit  schematic  and  its  PC  board  layout 
are  unrelated.  The  position  of  a symbolic  element  in  a 
circuit  diagram  is  totally  un-correlated  with  the  location 
of  the  corresponding  physical  component  in  the  PC  board. 
The  arrangements  of  symbolic  elements  in  a circuit  diagram 
are  made  to  enhance  its  neat  appearance.  The  layout  of 
physical  components  and  wirings  in  the  PC  board  is  dictated 
by  physical  constraints  and  electrical  properties. 

The  problem  of  determining  whether  or  not  two  circuit 
diagrams  (or  one  circuit  diagram  and  one  PC  board)  are 
identical  is  the  same  as  the  graph  isomorphism  problem, 
considerably  simplified  by  the  labels  of  the  edges.  The 
circuit  diagram  and  PC  board  can  be  represented  by  their 
corresponding  representative  graphs.  Finding  a unique  set 
of  invariants  for  a graph  implies  the  solution  of  the 
isomorphism  problem  as  well,  because  two  graphs  would  be 
isomorphic  if  and  only  if  their  invariants  were  the  same. 

6.1.1  Graph  Model inq 

Prior  to  the  matching  and  identification  process,  both 
circuit  diagram  and  PC  board  image  are  converted  to  graph 
representations  by  the  computer.  We  make  use  of  the  AUTORED 
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system,  which  we  developed  seven  years  ago  (Tou  and  Cheng 
[1983]),  to  read  the  circuit  diagram.  The  AUTORED 
identifies  nodes,  recognizes  symbolic  elements,  and  extracts 
connection  lines  from  the  circuit  diagram.  The  data 
generated  by  AUTORED  are  used  to  convert  the  circuit  diagram 
to  a graph  which  is  referred  to  as  C-graph.  Via  image 
processing  and  interpretation,  the  computer  reads  the  PC 
board  and  identifies  its  physical  components,  connecting 
wires,  and  pads.  This  information  is  used  to  convert  the  PC 
board  image  to  a graph  which  is  referred  to  as  P-graph.  The 
matching  and  identification  process  is  performed  on  the 
graph  representation. 

The  process  of  converting  from  a circuit  diagram  or  a PC 
board  to  its  equivalent  graph  representation  is  called  the 
graph  modeling.  The  principle  of  graph  modeling  for  circuit 
diagrams  and  PC  boards  is  the  same.  Here,  we  use  circuit 
diagram  as  the  example  to  illustrate  the  graph  modeling 
process. 

A circuit  diagram  is  represented  as  conjunctions  of 
logical  predicates  which  have  element  names  and  nodes  as 
terms.  It  is  formed  by  the  interconnection  of  various  types 
of  electrical  elements.  We  consider  a circuit  diagram  to  be 
a connected  graph  whose  edges  are  composed  of  electrical 
elements;  this  means  that  each  element  has  accessible 
terminals  by  means  of  which  it  may  be  joined  with  other 
elements.  Figure  6.1(a)  illustrates  a circuit  diagram.  In 
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this  figure,  there  are  many  element  nodes  which  are 
electrically  equivalent  and  should  be  merged  into  node 
groups  to  reduce  the  number  of  primitives  in  the  circuit 
diagram.  A node  group  consists  of  several  equivalent 
element  nodes  with  the  same  electrical  properties  such  as 
voltage  and  current  in  the  circuit  diagram. 

The  rules  to  merge  equivalent  element  nodes  into  node 
group  are  as  follows: 

Rule  1:  IF  there  is  no  element  connecting  between  element 

node  i and  element  node  j > THEN  element  node  i and 

element  node  j are  equivalent. 

Rule  2:  IF  element  node  i and  element  node  j are  equivalent 

AND  (Xi  < Xj)  n (Yi  < Yj)  is  TRUE > THEN  element  node  i 

is  the  representative  node  of  the  node  group  and  (X,Y)  = 

(Xf,Yi)  , where  (Xj^,Yj^)  is  the  coordinate  of  element  node  i 
and  (X,Y)  is  the  coordinate  of  the  representative  node  of 
the  merged  node  group. 

Applying  rule  1 and  rule  2 iteratively  to  a circuit 
diagram  can  merge  all  equivalent  element  nodes  into  node 
groups.  The  merging  process  is  terminated  by  labeling  the 
representative  nodes  of  the  generated  node  groups  according 
to  the  scanning  order  from  top  to  bottom  and  from  left  to 
right.  After  the  first  pass  is  over,  the  equivalent  classes 
are  easily  obtained  by  merging  all  element  nodes  in  the  same 
group.  At  the  same  time,  new  numbers  are  stored  in  the 
group  number  array  in  order  to  label  correct  group  numbers 
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in  the  second  pass.  Referring  to  the  circuit  diagram  in 
Figure  6.1(a),  the  result  of  the  first  pass  is  illustrated 
in  Figure  6.1(b),  which  shows  that  element  nodes  2,  3 , and 
5 have  been  merged.  In  Figure  6.1(b),  the  nodes  in  circles 
are  the  representative  nodes  of  the  merged  node  groups.  The 
label  of  each  representative  node  is  assigned  to  all  element 
nodes  belonging  to  the  same  node  group.  By  assigning  the 
same  group  number  to  them  and  adjusting  others,  we  can 
obtain  the  result  as  shown  in  Figure  6.1(c). 

If  node  groups  of  a circuit  diagram  are  chosen  as 
primitives,  each  vertex  may  represent  properties  of  the  node 
group,  and  each  edge  may  represent  spatial  relationships 
between  node  groups.  In  other  words,  the  circuit  diagram 
can  be  thought  of  as  a graph  with  the  vertices  representing 
the  node  groups  and  the  edges  representing  the  elements 
connecting  the  node  groups.  In  this  graph,  we  represent  the 
vertices  as  small,  disjunctive  circles  in  the  plane.  For 
each  edge  of  the  graph  we  draw  a line  from  the  initial 
vertex  to  the  terminal  vertex  of  the  edge  and  write  the 
element  name  of  the  edge  beside  it.  An  example  of  such  a 
graph  is  shown  in  Figure  6.1(d),  which  is  the  generated  C- 
graph  of  the  merged  circuit  diagram  shown  in  Figure  6.1(c). 

Figure  6.2  is  an  example  illustrating  the  principle  of 
graph  modeling  for  a PC  board.  Figure  6.2(b)  is  the  result 
with  rule  1 and  rule  2 applying  to  Figure  6.2(a)  and  Figure 
6.2(c)  is  the  result  with  labeling  procedure  applying  to 
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(a) 


(b) 

Figure  6.1  a)  Circuit  Diagram  1,  b)  Result  after  Node 
Merging,  c)  Result  after  Node  Relabeling, 
d)  C-graph  1. 
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5 


(d) 

Figure  6.1  (Continued). 
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(a) 


(b) 

Figure  6.2  a)  A PC  Board,  b)  Result  after  Node  Merging, 

c)  Result  after  Node  Relabeling,  d)  The  P-graph 
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d>  (D 


(C) 


(d) 


Figure  6.2 


(Continued) . 
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Figure  6.2(b).  The  P-graph  of  the  merged  PC  board  is 
illustrated  in  Figure  6.2(d). 

6.1.2  Graph  Representation 

The  P-graph  or  C-graph  generated  by  the  graph  modeling 
process  may  be  uniquely  specified  by  its  associated 
adjacency  matrix  and  edge  adjacency  matrix. 

The  adjacency  matrix  of  a graph  suggests  a method  of 
representing  any  binary  relation  on  a finite  set.  A binary 
relation  on  a finite  set  V of  order  n is  defined  to  be  a 
subset  of  the  set  of  all  ordered  pairs  of  elements  of  V. 
The  pair  (vj^,Vj)  belongs  to  the  subset  i.e.,  to  the 
relation,  if  and  only  if  Vj^  bears  the  relation  to  Vj . We 
can  represent  the  set  V as  the  vertices  of  a graph  in  which 
each  ordered  pair  (vj^,Vj)  of  the  relation  is  represented  by 
an  edge  from  the  vertex  corresponding  to  Vj^  to  the  vertex 
corresponding  to  Vj . In  this  graph  there  is  at  most  one 
edge  from  one  vertex  to  another.  We  can  now  describe  both 
the  relation  and  the  associated  graph  by  means  of  an 
adjacency  matrix  A of  order  n: 

A = [aij]n  (6-1) 


such  that 
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r 1 if  (vi,vj)  belongs  to  the  relation 
^ 0 otherwise  (6-2) 

In  essence,  an  adjacency  matrix  A defines  all  the 
properties  of  the  corresponding  relation.  The  adjacency 
matrix  is  symmetric  since  if  for  each  pair  (vj^,vj)  belonging 
to  it,  the  pair  (Vj,Vj^)  also  belongs  to  the  relation.  For 
instance,  consider  the  C-graph  and  P-graph  as  shown  in 
Figures  6.1(d)  and  6.2(d),  their  adjacency  matrices  are 
shown  in  Tables  6.1  and  6.2  respectively. 

As  to  the  generation  of  the  edge  adjacency  matrix,  let 
us  now  look  at  the  C-graph  such  as  is  illustrated  in  Figure 
6.1(d),  in  which  C,  R,  and  T denote  components  "Capacitor", 
"Resistor",  and  "Transistor"  respectively.  All  the  vertices 
in  the  graph  together  constitute  a finite  set  and  the 
condition  that  certain  pairs  of  them  be  connected  when  a 
given  component  is  a relation  on  this  set,  associated  with 
that  particular  component  symbol.  That  is,  if  there  are  n 
vertices,  then  for  each  component  symbol  x of  the  graph, 
there  exists  an  n x n relational  matrix 

rX  = [rijX]  (6-3) 


such  that 
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Table  6.1  Adjacency  Matrix  for  C-graph  1. 


1 2 3 4 5 6 


1 

2 

3 

4 

5 

6 


0 

1 

1 

0 

0 

0 


1 

0 

1 

1 

0 

0 


1 

1 

0 

1 

0 

1 


0 

1 

1 

0 

1 

1 


0 

0 

0 

1 

0 

1 


0 

0 

1 

1 

1 

0 


Table  6.2  Adjacency  Matrix  for  P-graph. 
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r 1 if  i and  j are  connected  via  component  x 
^ 0 otherwise  (6-4) 


For  example,  in  the  case  of  the  C-graph  of  Figure  6.1(d),  we 
have 


rX  = 


0 1 
1 0 
0 0 
0 0 
0 0 
0 0 


0 0 
0 0 
0 0 
0 0 
0 1 
0 0 


0 0 
0 0 
0 0 
1 0 
0 0 
0 0 


since  components  are  bilateral  devices,  these  relational 
matrices  are  always  symmetric. 

The  edge  adjacency  matrix  of  the  graph  may  now  be 
defined  as  a weighted  sum  of  these  relational  matrices.  If 
Xj^  (i=l,2,  • • • ,r)  are  the  components,  this  sum  is  defined  to 
be 

P = SiXiR^i  1 < i < r (6-5) 

where  r is  the  number  of  distinct  component  types  in  the 
circuit  diagram.  The  ij  entry  of  xR^  is  defined  to  be  xr^j 
if  i 7^  j , and  i if  i = j . 
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In  the  case  of  our  example,  we  have 


P = CR^  + RR^  + TR'^  = 


0 

C 

R 

0 

0 

0 


C 

0 

T 

T 

0 

0 


R 

T 

0 

T 

0 

R 


0 

T 

T 

0 

C 

R 


0 

0 

0 

C 

0 

R 


0 

0 

R 

R 

R 

0 


which  can  be  determined  directly  from  the  graph  by 
inspection.  The  symbol  is  the  "inclusive  or"  operation 

of  Boolean  algebra. 

According  to  the  above  procedure,  the  edge  adjacency 
matrix  of  the  P-graph  shown  in  Figure  6.2(d)  is  generated 
and  shown  as  follows: 


0 0 0 C 
0 0 C T 
0 C 0 0 
C T 0 0 
R T 0 T 
0 R R 0 


R 0 
T R 
0 R 
T 0 
0 R 
R 0 


where  the  ij^^  element  is  x if  there  exists  an  element  of 
type  X from  vertex  i to  vertex  j,  and  is  0 otherwise. 


6.1.3  Illustration  Example 

Our  approach  to  the  matching  and  identification  of 
circuit  diagrams  with  PC  boards  consists  of  two  phases;  the 
isomorphism  test  and  the  vertex  partitioning.  The 
isomorphism  test  serves  as  a screening  process  to  select 
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candidate  circuit-PCB  pairs.  The  vertex  partitioning  serves 
as  an  identification  process  to  ensure  the  correct  circuit- 
PCB  pair  and  to  determine  the  correspondence  between  nodes 
in  the  circuit  diagram  and  the  pads  in  the  PC  board.  The 
isomorphism  test  and  vertex  partitioning  are  carried  out 
based  on  the  computation  and  comparison  of  the  graph 
invariants  and  vertex  invariants.  The  derivation  and 
description  of  each  invariant  have  been  discussed  in  the 
previous  chapters.  Here  we  won't  explain  in  detail  how  all 
the  invariants  are  generated.  We  only  discuss  the  employed 
invariants  for  the  circuit-PCB  matching  and  identification 
and  provide  example  to  illustrate  the  matching  and 
identification  process. 

The  isomorphism  test  is  conducted  to  select  candidate 
circuit-PCB  pairs.  The  graph  invariants  used  for  the 
isomorphism  test  are  (1)  number  of  vertices,  (2)  number  of 
edges,  (3)  degree  distribution,  (4)  connectivity  code,  (5) 
canonical  code,  and  (6)  canonical  string.  The  computer 
extracts  these  graph  invariants  from  both  C-graph  and  P- 
graph  and  conducts  the  isomorphism  test  based  on  these 
extracted  graph  invariants.  The  isomorphism  test  provides 
the  necessary  condition  for  the  identification  of  candidate 
circuit-PCB  pairs. 

The  vertex  partitioning  is  conducted  to  determine  the 
correct  circuit-PCB  pairs.  The  vertex  invariants  used  for 
the  vertex  partitioning  are  (1)  edge  adjacency  vector,  (2) 
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connectivity  vector,  and  (3)  relaxation  partitioning.  The 
vertex  partitioning  provides  the  sufficient  condition  for 
the  determination  of  correct  circuit-PCB  pairs.  In  a 
circuit-PCB  pair,  the  correspondence  between  nodes  in  the 
circuit  diagram  and  the  pads  in  the  PC  board  is  also 
identified  in  the  vertex  partitioning  process. 

To  illustrate  the  circuit-PCB  matching  and 
identification  process,  let  us  consider  the  following 
example.  Two  circuit  diagrams  are  shown  in  Figures  6.1(a), 
6.3(a),  6.4(a),  and  6.5(a)  and  a PC  board  is  shown  in 
Figure  6.2(a).  Following  our  approach,  the  circuit  diagrams 
and  the  PC  board  are  converted  to  C-graphs  and  P-graph,  as 
shown  in  Figures  6.1(d),  6.3(b),  6.4(b),  and  6.5(b)  and 

Figure  6.2(d),  respectively.  To  conduct  the  isomorphism 
test,  the  computer  generates  seven  graph  invariants  and 
perform  the  matching  process  based  on  these  generated  graph 
invariants . 

Applying  Algorithm  2-2  to  this  example,  the  process  is 
illustrated  in  Table  6.3.  The  analysis  based  on  the 
generated  graph  invariants  reveals  that  circuit  1 and  the 
PCB  is  a candidate  pair  and  circuits  2,  3,  4 and  the  PCB  are 
not.  Then,  we  perform  the  vertex  partitioning  on  circuit  1 
and  the  PCB.  The  computer  extracts  the  component-related 
vertex  invariants.  The  vertex  partitioning  based  on  the 
generated  vertex  invariants  has  identified  that  circuit 
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(a) 


1 


(b) 


Figure  6.3  a)  Circuit  Diagram  2,  b)  C-graph  2. 
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(a) 


1 


(b) 


Figure  6.4  a)  Circuit  Diagram  3,  b)  C-graph  3. 
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(a) 


1 


(b) 


Figure  6.5  a)  Circuit  Diagram  4,  b)  C-graph  4. 
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Table  6.3  Isomorphism  Test  and  Vertex  Partitioning  Process. 


ISOMORPHISM  TEST 


Graph 

Number  of  Vertices 

C-graph  1 

6 

C-graph  2 

6 

C-graph  3 

6 

C-graph  4 

6 

P-graph 

6 

Graph 

Number  of  Edges 

C-graph  1 

9 

C-graph  2 

9 

C-graph  3 

9 

C-graph  4 

9 

P-graph 

9 

Graph 

Degree  Distribution 

C-graph  1 

(2,2,2) 

C-graph  2 

(2,2,2) 

C-graph  3 

(2,2,2) 

* C-graph  4 

(1,3,2) 

P-graph 

(2,2,2) 

Graph 

Connectivity  Code 

C-graph  1 
C-graph  2 
* C-graph  3 
P-graph 

(221,221,320,320,410,410) 

(221.221.320.320.410.410) 

(230.230.320.320.410.410) 

(221.221.320.320.410.410) 
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Graph 

Canonical  Code 

C-graph  1 

(30001011,110000110,110100001, 

102010011,22110101,112101110) 

C-graph  2 

(110000110,110001001,102010011, 

120100011,112101101,112011110) 

P-graph 

(30001011,110000110,110100001, 

102010011,22110101,112101110) 

Graph 

Canonical  String 

C-graph  1 

3R3X1C1R2X1C1R2X1C2T3X2R2T4X1C1R2T4X 

* C-graph  2 

1C1R2X1C1R2X1C2T3X1C2R3X1C1R2T4X1C1R2T4X 

P-graph 

3R3X1C1R2X1C1R2X1C2T3X2R2T4X1C1R2T4X 

C 

R 

T 

6 

1 

5 

2 

3 

4 

C 

R 

T 

6 

1 

3 

4 

5 

2 

C 

0 

0 

0 

0 

0 

0 

0 

0 

0 

c 

0 

0 

0 

0 

0 

0 

0 

0 

0 

R 
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diagram  1 is  a correct  match  of  the  PCB  and  the  node-pad 
correspondence  between  circuit  diagram  1 and  the  PCB  is; 


Cg,Ci,C5,C2/C3, C4 

P6/Pl/P3'P4»P5/P2 

where  Cj^  denotes  the  i^^  vertex  of  the  circuit  diagram  and 
Pj^  denotes  the  i^^  vertex  of  the  PCB. 

6.2  Representation  of  Pictorial  Drawings  for  3-D  Recognition 

The  application  of  the  graph  isomorphism  algorithm  to 
the  representation  of  pictorial  drawings  for  3-D  recognition 
is  presented  in  this  section.  A 3-D  object  can  be 
characterized  by  its  pictorial  drawings  which  are  generated 
from  the  intensity  images  taken  from  the  object.  Thus,  the 
recognition  of  pictorial  drawings  can  serve  as  the  basis  for 
3-D  recognition. 

Graph  isomorphism  has  been  used  to  express  the  basic 
structural  similarity  between  two  pictorial  drawings 
(Roberts  [1965],  Guzman  [1967],  Shapiro  and  Haralick 
[1982]).  Two  pictorial  drawings  are  isomorphic  if  there 
exists  a one-to-one  correspondence  between  the  vertices  of 
them  that  preserves  the  adjacency  of  vertices. 

In  order  to  uniquely  represent  a pictorial  drawing,  a 
set  of  features  including  graph  invariants  and  vertex 
invariants  are  employed.  Since  these  features  are  the 
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invariants  of  pictorial  drawings,  they  are  applicable  to  the 
representation  and  recognition  of  pictorial  drawings  as  long 
as  the  primitives  and  relationships  represented  have 
invariant  features.  The  features  of  prototype  pictorial 
drawings  are  called  stored  models  and  are  used  as  part  of 
the  knowledge  base  of  a recognition  system.  Such  a system 
inputs  an  object  pictorial  drawing,  computes  its  features, 
and  tries  to  identify  it  with  a stored  model.  Thus,  instead 
of  asking  whether  two  pictorial  drawings  match  each  other, 
we  will  only  ask  whether  the  features  of  the  object 
pictorial  drawing  match  the  features  of  the  prototype 
pictorial  drawing. 

6.2.1  Graph  Modeling 

A pictorial  drawing  may  be  treated  as  a graph  in  which 
the  nodes  corresponding  to  the  vertices  and  the  branches  to 
the  edges.  To  illustrate  this,  let  us  consider  the 
pictorial  drawing  as  shown  in  Figure  6.6(a).  The  graph 
represented  for  the  pictorial  drawing  is  illustrated  in 
Figure  6.6(b) . 

6.2.2  Graph  Representation 

Structural  information  are  contained  in  the  vertex  of 
the  pictorial  drawings.  Such  structural  information  may  be 
captured  by  vertex  types.  A vertex  can  be  classified 
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1 2 


7 8 


(a) 


1 2 


7 8 


(b) 


Figure  6.6  a)  Model  Pictorial  Drawing  1,  b)  Model  Pictorial 
Graph  1. 
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according  to  the  geometrical  configuration  of  its  incident 
edges.  The  vertex  types  considered  here  are  illustrated  in 
Figure  6.7  and  described  below. 

V:  formed  by  the  corner  of  a surface  boundary 
Y:  formed  by  the  intersection  of  three  visible  surfaces 
A:  formed  by  the  intersection  of  two  visible  surfaces 
T:  formed  by  the  intersection  with  an  occluded  surface 
F:  formed  by  the  intersection  of  several  visible  surfaces  at 
a common  vertex 

K:  formed  by  partial  overlapping  of  two  surfaces 

The  graph  generated  by  the  graph  modeling  process  may  be 
uniquely  specified  by  its  associated  adjacency  matrix  and 
vertex  type  table.  The  generation  of  vertex  type  table  is 
readily  accomplished  by  assigning  the  associated  vertex  type 
to  each  corresponding  vertex.  For  example,  referring  to  the 
graph  in  Figure  6.6(b),  its  associated  adjacency  matrix  and 
vertex  type  table  are  shown  in  Tables  6.4  and  6.5 
respectively. 

6.2.3  Illustration  Example 

Two  pictorial  drawings  may  be  distinguished  from  their 
associated  features  which  forms  the  basis  for  model 
matching.  Recognition  problem  is  then  treated  as  a 
matching  problem:  matching  the  features  of  an  object  with 

the  features  of  a stored  model.  By  matching  features 
instead  of  simply  correlating  whole  pictorial  drawings,  we 


199 


Figure  6.7  Vertex  Type  Dictionary. 
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Table  6.4  Adjacency  Matrix  for  Model  Pictorial  Drawing  1. 


12345678 


1 

2 

3 

4 

5 

6 

7 
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0 

1 

1 

0 

0 

0 

0 

0 


1 

0 

0 

1 

1 

0 

0 

0 


1 

0 

0 

1 

0 

1 

0 

0 


0 

1 

1 

0 

0 

0 

0 

1 


0 

1 

0 

0 

0 

0 

0 

1 


0 

0 

1 

0 

0 

0 

1 

0 


0 

0 

0 

0 

0 

1 

0 

1 


0 

0 

0 

1 

1 

0 

1 

0 


Table  6.5  Vertex  Type  Table  for  Model  Pictorial  Drawing  1. 


Vertex 

Vertex  Type 

1 

V 

2 

W 

3 

Y 

4 

Y 

5 

V 

6 

V 

7 

V 

8 

W 
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can  make  the  matching  process  invariant  under  specific 
rotations,  or  under  spatial  rearrangements  that  preserve 
adjacency  or  connectivity  between  vertices. 

Our  approach  to  the  recognition  of  pictorial  drawings 
consists  of  two  phases;  the  isomorphism  test  and  the  vertex 
partitioning.  The  isomorphism  test  serves  as  a screening 
process  to  select  candidate  pictorial  drawings.  The  vertex 
partitioning  serves  as  an  identification  process  to  ensure 
the  match  and  to  determine  the  vertex  correspondence  between 
the  object  pictorial  drawing  and  the  match  pictorial 
drawing. 

The  isomorphism  test  is  conducted  to  select  candidate 
pictorial  drawings.  The  features  used  for  the  isomorphism 
test  are  (1)  number  of  vertices,  (2)  number  of  edges,  (3) 
connectivity  code,  (4)  canonical  code,  and  (5)  canonical 
string.  The  computer  extracts  these  features  from  both 
object  pictorial  drawing  and  model  pictorial  drawing  and 
conducts  the  isomorphism  test  based  on  these  extracted 
features.  The  isomorphism  test  provides  the  necessary 
condition  for  the  identification  of  candidate  pictorial 
drawings. 

The  vertex  partitioning  is  conducted  to  determine  the 
one-to-one  vertex  correspondence  between  the  object 
pictorial  drawing  and  the  candidate  pictorial  drawing  to 
ensure  the  correct  match.  The  features  used  for  the  vertex 
partitioning  are  (1)  vertex  type,  (2)  surrounding  simple 
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cycle  number,  (3)  simple  cycle  length  vector,  (4)  vertex 
label  simple  cycle  vector,  and  (5)  relaxation  partitioning. 
The  vertex  partitioning  provides  the  sufficient  condition 
for  the  determination  of  match  pictorial  drawing.  In  an 
object-match  pictorial  drawing  pair,  the  correspondence 
between  the  vertices  in  the  object  pictorial  drawing  and  the 
vertices  in  the  match  pictorial  drawing  is  also  identified 
in  the  vertex  partitioning  process. 

To  illustrate  the  pictorial  drawing  recognition  process, 
let  us  consider  the  following  example.  Two  model  pictorial 
drawings  are  illustrated  in  Figures  6.6(a),  6.8(a),  6, 9 (a), 
and  6.10(a)  and  an  object  pictorial  drawing  is  illustrated 
in  Figure  6.11(a).  Following  our  approach,  the  model 
pictorial  drawings  and  the  object  pictorial  drawing  are 
converted  to  the  graph  forms  as  shown  in  Figures  6.6(b), 
6.8(b),  6.9(b),  and  6.10(b)  and  Figure  6.11(b), 
respectively.  To  conduct  the  isomorphism  test,  the  computer 
generates  the  graph  invariants  and  perform  the  matching 
process  based  on  these  generated  graph  invariants. 

Applying  Algorithm  2-1  to  this  example,  the  process  is 
illustrated  in  Table  6.6.  The  analysis  based  on  the 
generated  graph  invariants  reveals  that  model  pictorial 
drawing  1 and  the  object  pictorial  drawing  is  a candidate 
pair  and  model  pictorial  drawing  2,  3,  4 and  the  object 
pictorial  drawing  are  not.  Then,  we  perform  the  vertex 
partitioning  on  model  pictorial  drawing  1 and  the  object 
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1 2 


(a) 


1 2 


(b) 


Figure  6.8  a)  Model  Pictorial  Drawing  2,  b)  Model  Pictorial 
Graph  2 . 
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1 


(a) 


1 


Figure  6.9  a)  Model  Pictorial  Drawing  3,  b)  Model  Pictorial 
Graph  3 . 
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Figure 


1 


(a) 


1 


.10  a)  Model  Pictorial  Drawing  4,  b)  Model  Pictorial 
Graph  4 . 
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1 2 


7 8 


(a) 


1 2 


(b) 


Figure  6.11  a)  Object  Pictorial  Drawing, 
Pictorial  Graph. 


Object 
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Table  6.6  Isomorphism  Test  and  Vertex  Partitioning  Process. 


ISOMORPHISM  TEST 


Graph 

Number  of  Vertices 

Model  1 

8 

Model  2 

8 

Model  3 

8 

* Model  4 

9 

Object 

8 

Graph 

Number  of  Edges 

Model  1 

10 

Model  2 

10 

* Model  3 

11 

Object 

10 

Graph 

Canonical  Code 

Model  1 

(1000000011,1000000101,1000010010,1000100100, 

100000111,101011000,10010101000,10011001000) 

Model  2 

(1000000101,1000010001, 1000000110,1000100001, 
100001101,11000001010,10000011100,10010100001) 

Object 

(1000000011,1000000110,1010000001,1011000000, 

110000011,100101100,10001011000,10000110000) 

Graph 

Canonical  String 

Model  1 

1V2X1V2X1V2X1V2X1Y3X1Y3X1A3X1A3X 

* Model  2 

1V2X1V2X1V2X1V2X1Y3X1A3X1A3X1A3X 

Object 

1V2X1V2X1V2X1V2X1Y3X1Y3X1A3X1A3X 
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VERTEX  PARTITIONING 


Vertex 

Vertex  Type 
for  Model  1 

Vertex  Type 
for  Object 

1 

V 

A 

2 

A 

V 

3 

Y 

Y 

4 

Y 

Y 

5 

V 

V 

6 

V 

V 

7 

V 

A 

8 

A 

V 

(m2, ms) / (mi, ms, mg, m7) , (m3,m4) 
(01,07) , (02,05,05,03) , (03,04) 


Vertex 

Surrounding  Simple  Cycle 
Number  for  Model  1 

Surrounding  Simple  Cycle 
Number  for  Object 

1 

1 

2 

2 

2 

1 

3 

2 

3 

4 

3 

2 

5 

1 

1 

6 

1 

1 

7 

1 

2 

8 

2 

1 

(m2, mg) , (mi, ms, mg, m7) ,m3,m4 
(oi,o?) , (02,05,05,03) ,04,03 
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Vertex 

Simple  Cycle  Length 
Vector  for  Model  1 

Simple  Cycle  Length 
Vector  for  Object 

1 

(0,1,0) 

(0,2,0) 

2 

(0,2,0) 

(0,1,0) 

3 

(0,1,1) 

(0,2,1) 

4 

(0,2,1) 

(0,1,1) 

5 

(0,1,0) 

(0,1,0) 

6 

(0,0,1) 

(0,0,1) 

7 

(0,0,1) 

(0,1,1) 

8 

(0,1,1) 

(0,0,1) 

07,01, (05,03) , (02,05) ,04,03 


Vertex 

Vertex  Label  Simple  Cycle 
Vector  for  Model  1 

1 

(1,1,2) 

2 

(1,1,2) , (2,1,1) 

3 

(1,1,2) , (2,2,1) 

4 

(1,1,2) , (2,1,1) , (2,2,1) 

5 

(2,1,1) 

6 

(2,2,1) 

7 

(2,2,1) 

8 

(2,1,1) , (2,2,1) 

Vertex 

Vertex  Label  Simple  Cycle 
Vector  for  Object 

1 

(1,1,2) , (2,1,1) 

2 

(1,1,2) 

3 

(1,1,2) , (2,1,1) , (2,2,1) 

4 

(1,1,2) , (2,2,1) 

5 

(2,1,1) 

6 

(2,2,1) 

7 

(2,1,1) , (2,2,1) 

8 

(2,2,1) 

mg, m2, (mg, my) ,mi,m5,m3,m4 
Oy,Oi, (05,03) ,02,05,04,03 
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AMBIGUITY  RESOLUTION 


Cell  Index 


I 

II 

III 

IV 

V 

VI 

VII 


Vertex  of 
Model  1 
8 

2 

6 

7 

1 

5 

3 

4 


List 

(0,0, 1,0, 1,0,1) 

(0,0, 0,1, 1,0,1) 

(0,0, 1,0, 0,1,0) 
(1,0, 1,0, 0,0,0) 

(0,1, 0,0, 0,1,0) 

(1,1, 0,0, 0,0,0) 

(0,0, 1,1, 0,0,1) 

(1,1, 0,0, 0,1,0) 


Cell  Index 

I 

II 

III 

IV 

V 

VI 

VII 

VIII 


Vertex  of 
Model  1 
8 
2 
6 
7 
1 
5 

3 

4 
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Cell  Index 

I 

II 

III 

IV 

V 

VI 

VII 


Vertex  of 
Object 

7 

1 

6 

8 

2 

5 

4 

3 


(0,0 

(0,0 

(0,0 

(1,0 

(0,1 

(1,1 

(0,0 

(1,1 


Cell  Index 

I 

II 

III 

IV 

V 

VI 

VII 

VIII 


Vertex  of 
Object 

7 
1 
6 

8 
2 
5 
4 
3 


’"'>8  ' ^^2  ' ' ^7  ' ™1 ' ^5 ' ’'"3  ' ^4 

07,02,06,03,02,05,04, 03 


List 

1,0, 1,0,1) 

0,1, 1,0,1) 

1,0, 0,1,0) 

1,0, 0,0,0) 

0,0, 0,1,0) 

0, 0, 0, 0, 0) 

1,1, 0,0,1) 

0,0, 0,1,0) 
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pictorial  drawing.  The  computer  extracts  seven  component- 
related  vertex  invariants.  The  vertex  partitioning  based  on 
the  generated  vertex  invariants  has  identified  that  model 
pictorial  drawing  1 is  a correct  match  of  the  object 
pictorial  drawing  and  the  vertices  of  model  pictorial 
drawing  1 matches  with  the  vertices  of  the  object  pictorial 
drawing  as  follows: 


ms , m2  , ni6 , my , mi , m5 , m3 , m4 

07,01,06,08,02,05,04, 03 


where  m^  denotes  the  i^^  vertex  of  the  model  pictorial 
drawing  and  o^  denotes  the  i"*^^  vertex  of  the  object 
pictorial  drawing. 


CHAPTER  VII 

SUMMARY  AND  CONCLUSIONS 
7 . 1 Summary 

In  this  dissertation,  the  strategy  of  the  proposed 
feature-oriented  label  graph  isomorphism  algorithm  was 
specified,  and  various  feature-oriented  invariants  were 
analyzed  and  developed.  In  comparison  with  the  previous 
research,  this  study  incorporates  vertex  and  edge  features 
in  the  computation  of  the  invariants  with  the  aim  of  solving 
the  label  graph  isomorphism  problem  more  effectively. 

In  Chapter  I,  we  investigated  the  limitation  of  the 
non-label  graph  isomorphism  algorithm  in  solving  the  label 
graph  isomorphism  problem  and  addressed  that  the  need  for  an 
efficient  label  graph  isomorphism  algorithm  had  been 
recognized.  Then  we  surveyed  the  research  progress  in  the 
past  decade  and  found  that  graph  isomorphism  problem  is 
still  a problem  yet  to  be  resolved.  We  believed  that 
research  in  this  direction  should  be  encouraged. 

In  Chapter  II,  we  proposed  the  feature-oriented  label 
graph  isomorphism  algorithm  for  solving  the  label  graph 
isomorphism  problem  and  characterized  the  role  of  vertex  and 
edge  features  in  the  algorithm.  The  developed  graph 
isomorphism  algorithm  consisting  of  the  isomorphism  test  and 
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vertex  partitioning  was  especially  useful  when  testing 
several  graphs  for  isomorphism.  Vertex  features  and  edge 
features  were  utilized  in  the  computation  of  the  invariants 
with  the  aim  of  finding  candidate  graphs  in  the  isomorphism 
test  and  identifying  match  graph  in  the  vertex 
partitioning. 

In  Chapter  III,  the  focus  was  on  the  cycle  structure  of 
a graph.  A simple  cycle  extraction  algorithm  was  introduced 
for  the  purpose  of  extracting  simple  cycles  from  a graph. 
The  extracted  simple  cycles  of  the  graph  were  then  stored  in 
the  simple  cycle  matrix.  Determination  of  the  surrounding 
simple  cycle  vector  and  simple  cycle  invariants  of  a given 
vertex  were  based  on  the  simple  cycle  matrix.  A cycle 
extraction  algorithm  was  then  introduced  to  extract  the 
cycles  surrounding  a given  vertex  with  an  emphasis  to  reduce 
the  complexity  of  the  algorithm.  Determination  of  the 
surrounding  cycle  vector  of  a given  vertex  was  based  on  the 
extracted  cycles  surrounding  the  vertex.  The  simple  cycle 
invariants  and  cycle  invariants  were  selected  as  the 
invariants  upon  which  comparisons  among  vertices  should  be 
based.  The  former  provides  important  information  for  the 
graphs  under  rotation,  translation,  or  scale  changes,  the 
latter  provides  important  information  for  the  graphs  under 
isomorphism. 

The  canonical  encoding  algorithm  presented  in  Chapter  IV 
was  based  on  the  permutations  of  rows  and  columns  on  the 
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modified  adjacency  matrix  of  the  graph.  By  use  of  both  the 
connectivity  and  the  features  of  each  vertex  and  each  edge 
in  the  representative  graph,  the  modified  adjacency  matrix 
is  reduced  to  a canonical  representation.  This  canonical 
representation  is  unique  for  each  distinct  graph 
represented. 

In  Chapter  V,  we  developed  and  derived  the  invariants 
based  on  the  connectivity  analysis  of  a graph.  The 
connectivity  structure  plays  the  most  important  role  in 
characterizing  a graph.  The  invariants  indicating  the 
connectivity  of  a graph  were  effectively  used  either  as 
tools  in  the  isomorphism  test  or  as  prime  criteria  for  the 
vertex  partitioning  to  resolve  ambiguity. 

In  Chapter  VI,  the  focus  was  on  the  applications  of  the 
proposed  graph  isomorphism  algorithm.  Two  applications  were 
reported,  which  were  (1)  the  matching  and  identification  of 
circuit  diagrams  with  PC  boards,  and  (2)  the  representation 
of  pictorial  drawings  for  3-D  recognition.  We  had 
successfully  employed  the  proposed  algorithm  on  these  two 
applications. 


7.2  Significance  of  this  Research 

In  addition  to  those  powerful  and  newly-proposed 
feature-oriented  invariants,  we  have  three  significant 
accomplishments  in  this  dissertation. 
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The  first  accomplishment  in  this  dissertation  is  to 
propose  a novel  approach  for  solving  the  label  graph 
isomorphism  problem. 

The  second  accomplishment  is  to  develop  an  efficient 
vertex  partitioning  scheme  which  can  effectively  partition 
the  vertex  set  into  smaller  and  smaller  subsets.  The  vertex 
partitioning  scheme  that  we  propose  is  based  on  the 
iterative  set  partitioning  technique,  while  the  previous 
works  use  a much  more  tedious  set  reduction  partitioning 
technique.  A cardinality  constraint  is  proposed  to  enhance 
the  iterative  set  partitioning  technique,  which  can  be 
viewed  as  a filter  to  eliminate  non-isomorphic  partitionings 
at  early  steps  without  wasting  time  in  partitioning  non- 
isomorphic graphs.  The  contradiction  verification 
(cardinality  violation)  of  each  partitioning  is  easily 
carried  out  by  use  of  the  cell  count  vector  and  cell  value 
vector. 

The  third  accomplishment  is  to  propose  a powerful 
ambiguity  resolution  scheme  which  guarantees  to  lead  to  the 
solution.  The  implementation  of  this  scheme  is  based  on 
the  relaxation  partitioning  technique  to  partition  the 
ambiguous  cell  set  iteratively  until  the  ambiguity  has  been 
resolved.  The  advantage  of  the  proposed  relaxation 
partitioning  technique  is  that  it  is  very  straightforward 
and  easy  to  implement,  while  the  previous  approaches  use  a 
very  tedious  scheme  which  is  difficult  to  implement. 
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7 . 3 Recommendations  for  Future  Research 

In  this  dissertation,  we  have  presented  some  research  in 
solving  the  graph  isomorphism  problem.  Although  the 
algorithm  has  shown  encouraging  results,  it  is  by  no  means 
complete.  More  research  work  is  needed  to  enhance  or 
improve  the  algorithm  especially  in  the  area  of  performance 
optimization. 

Since  this  dissertation  deals  with  the  application  of 
various  invariants  in  resolving  the  graph  isomorphism 
problem,  it  is  true  that  the  result  of  the  proposed  graph 
isomorphism  algorithm  depends  on  the  order  of  use  of  the 
various  invariants,  and  the  performance  of  each  invariant 
might  differ  with  different  graph  configurations.  To  have 
the  best  result,  the  most  useful  invariant  should  be  applied 
first.  In  this  dissertation,  the  order  of  use  of  all 
invariants  are  arbitrarily  chosen.  Consequently,  the 
result  is  not  optimal.  In  searching  for  an  optimal 
performance,  the  usefulness  index  of  each  invariant  and  the 
variations  of  the  usefulness  index  with  different  graph 
configurations  need  to  be  considered.  Research  in  this 
direction  might  be  an  important  area  in  the  future. 
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